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FIELD AND BACKGROUND OF THE INVENTION 

The present invention relates to a method of identifying peptides of a 
desired origin and which are capable of binding to MHC molecules of a 
particular haplotype; peptides identified by the method; pharmaceutical 
25 compositions containing the peptides, databases describing the peptides and 
the use of the peptides in vaccination. 

The following abbreviations are used herein: MHC, Major 
Histocompatibility Complex; (52m, p2-rmcroglobulin; ESI, electrospray 
ioni2ation; MS, mass spectrometry; m/z, mass charge 1 ratio; CID, collision 
30 induced disintegration; MS/MS, tandem mass spectrometry; MTDM DNA 
methyl transferase; FAS, fatty acid synthase; CTL, cytotoxic T lymphocytes; 
mAbs, monoclonal antibodies. 
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The MHC serves as a shuttle to transport and display peptide antigens 
on the surface of cells as an indication to the immune system of the health state 
of the cells, Each individual has at most six different MHC class-I haplotypes, 
out of the hundreds known. MHC bound peptides, i.e., peptides bound to ? and 
5 presented in context of, MHC molecules, originate from proteolysis of most of 
the proteins expressed in the cells. Therefore, unique sets of MHC bound 
peptides are displayed by each of the different MHC haplotypes according to 
the protein expression and degradation schemes of the cells and according to 
the peptide binding motifs of the MHC molecule^ (reviewed in [1]). 
I 10 Therefore, thousands of different peptides are presented by the different MHC 
« class-I haplotypes and each of the peptides is presented in vastly differing 

]l copies per cell [2]. When cell$ become infected, some of the presented 

P" peptides are derived from the pathogen's proteins* and so indicate to 

S circulating T-ceils to kill the diseased cells and prevent the spread of the 

C 35 disease, 

; ' A Each MHC haplotype recognizes the peptides through a broadly defined 

consensus motif of peptide's amino acids strategically positioned to setve as 
anchors to the appropriate binding pockets on the MHC molecule. The 
binding motifs of many of the MHCs haplotypes were first established by pool 
20 Edman sequencing of unfractionated peptide mixtures eluted from 
immunoaffimty purified MHC molecules [3. 4], The consensus was further 
extended by direct biochemical analysis of individual peptides separated by 
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reversed phase chromatography and analyzed by tandem mass spectrometry [2, 
5, 6], reviewed in [7], 

MHC bound peptides derived from cancer specific or associated 
proteins or antigens were extensively searched for, with the goal of finding 

5 among them peptide candidates for development of anti-cancer vaccines. A 
number of such tumor specific peptides were already identified and some were 
successfully tested as anti-cancer vaccines for human treatment, most notably 
for immunotherapy of melanoma [8, 9]. Three main approaches were 
extensively used for the identification of such MHC bound peptides [10], The 

10 genetic approach involves ttansfection of cDNA libraries, made from tumor 
cells, into cells that present the MHC allele of interest. The clones of 
transfected cells that stimulated CTL lines against the tumor cells were 
selected as the source for the tumor antigen and the genes were further 
fragmented to isolate the regions of the genes that encode the particular 

15 immunogenic peptide [11], The second approach is based on exploiting the 
known consensus binding motifs of the MHC haplotype of interest to scan 
sequences of known protein <4 m silico" and to predict putative MHC bound 
peptides that fit this consensus [12]. For successful prediction, these 
consensus motifs should be a priori well established, which is not the case for 

20 many of the MHC haplotypes [13]. The drawback of this approach is its 
reliance on chemical synthesis of a large number of peptides, only few of 
which end up being useful The biochemical third approach, involves the 
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fractionation of the MHC bound peptides by chromatography, assaying the 
fractions for irnrouonoglogical activity and sequencing the individual peptides 
in the active fractions (2, 5]. The biochemical approach is the only possible 
way to identify post-translationally modified peptides, not always predictable 
5 from the protein sequences [14-16]. The biochemical approach depends on the 
availability of advanced mass spectrometty s needed for analyzing the available 
minute amounts of peptides that are present at very complex mixtures 
(reviewed in [7]). 

All these approaches for identifying MHC bound peptides eventually 
io rely on chemical synthesis of the peptides of interest to test their capacity to 
bind to the MHC molecule by stabilization of empty MHC molecules on cell 
surface [17], and their potential to elicit an immune response by tetramer 
assays [18], ELISPOT [19] and elicitation CTL responses when presented on 
cells [20]. 

Cramdy. sequencing and identification of individual MHC bound 
peptides by the dire* biochemical approach is most effectively perfonned by 
oftandemmassspectrometry. The peptides are resolved by reversed phase 
chromatography and the eluting peptides are collected, assayed for biological 
activity .d fenced, most ofren by electrospray tandem mass spe— 
20 [2 5 211 . Co^parrngmepatte^of^Cboundpeptidesrecoveredfrom 
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analysis, its ability to analyze complex mixtures of peptides and its high 
sensitivity [7], The biochemical analysis involves the purification of the MHC 
molecules with their bound peptides by immunoaffinity chromatography using 
mAbs specific for the native MHC [2]. To this end 3 the cells are solubilized 

5 with detergents, the desired MHC molecules are purified with their MHC 
bound peptides and the MHC bound peptides are recovered by denaturation 
and ultra-filtration. However, once the cells are disrupted by the detergents, 
the MHC molecules become contaminated by cellular debris and detergents 
which complicates the subsequent ESI-MS/MS analysis. Moreover, such 

10 immunoaffinity purification of desired MHC haplotypes is possible only when 
specific mAbs are available, whereas for many MHC haplotypes such mAbs 
are presently unavailable. 

There is thus a widely recognized need for, and it would be highly 
advantageous to have, a method for identifying MHC bound peptides devoid 

15 of the above limitations. 



STTMMAKY OF THE INVENTION 

While conceiving the present invention, it was hypothesized that MHC 
20 bound peptides presented within the context of different MHC haplotypes on 
cells of different tissues or tumor origins can be biochemically identified by 
transforming the cells to express and secrete soluble MHC molecules of the 
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different MHC haplotypes, with the aim of biochemically identifying the MHC 
bound peptides that bind to the soluble MHC molecules. Should this approach 
be successful, it solves three major problems associated with the prior art 
biochemical approach. First, although not excluded, there is no need for 

5 specific mAbs per each tipe of MHC ? rather general mAbs such as W6/32 (anti 
HLA-A, B and C) can be used to isolate the sMHC and hence the MHC bound 
peptides from the growth medium in which the cells are grown.. Second, while 
the prior art approach relies on native MHC molecules, different MHC 
haplotypes directing the expression of different soluble MHC molecules can 

10 potentially be used for each of the cells, to thereby increase the repertoire of 
MHC bound peptides which can be used as, for example, anti-cancer vaccines. 
Third, since the cells are not disrupted and further since there is no use of 
detergents, the sMHC molecules do not become contaminated by cellular 
debris and detergents which otherwise complicates the subsequent 

15 ESI-MS/MS analysis. 

According to one aspect of the present invention there is provided a 
method of identifying peptides originating from a particular cell type and being 
capable of binding to MHC molecules of a particular haplotype, the method 
comprising obtaining a cell type expressing a soluble and secreted form of the 
MHC molecules of the particular haplotype; collecting the soluble and secreted 
form of the MHC molecules of the particular haplotype; and analyzing 
peptides bound to the soluble and secreted form of the MHC molecules of the 
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particular haplotype, thereby identifying the peptides originating from the 
particular cell type and being capable of binding to MHC molecules of the 
particular haplotype. 

According to further features in preferred embodiments of the invention 
described below, the cell type is a cancer cell 

According to still further features in the described preferred 
embodiments the cell type is a cancer cell line. 

According to still further features in the described preferred 
embodiments the cell type is a virus infected cell or cell line. 

According to still further features in the described preferred 
embodiments the cell type is a cell involved in a development and/or 
progression of an autoimmune diseases. 

According to another aspect of the present invention there is provided a 
method of identifying peptides originating from at least one protein of interest 
and being capable of binding to MHC molecules of a particular haplotype* the 
method comprising obtaining cells co-expressing the at least one protein of 
interest and a soluble and secreted form of the MHC molecules of the 
particular haplotype; collecting the soluble and secreted form of the MHC 
molecules of the particular haplotype; analyzing peptides bound to the soluble 
and secreted form of the MHC molecules of the particular haplotype; and 
identifying peptides originating from the at least one protein of interest and 
being capable of binding to MHC molec ules of the particular haplotype. 
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According to further features in preferred embodiments of the invention 
described below, the protein of interest in natively expressed by the celts. 

According to still further features in the described preferred 
embodiments the at least one protein of interest in expressed by the cells 
following transformation of the cells with nucleic acid encoding for the at least 
one protein of interest. 

According to still further features in the described preferred 
embodiments the at least one protein of interest includes a tumor associated 
antigen. 

According to still further features in the described preferred 
embodiments the at least one protein of interest includes a cytokine. 

According to still further features in the described preferred 
embodiments the at least one protein of interest includes a protein of a 
pathogen. 

According to still further features in the described preferred 
embodiments the soluble and secreted form of the MHC molecules include a 
polypeptide encoded by exons 5 to 8 of a murine mutant Q10 13 , 

According to still further features in the described preferred 
embodiments analyzing the peptides bound to the soluble and secreted form of 
the MHC molecules of the particular haplotype is by mass spectrometry, mass 
charge ratio and collision induced disintegration. 
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According to still further features in the described preferred 
embodiments identifying peptides originating from the at least one protein of 
interest and being capable of binding to MHC molecules of the particular 
haplotype is by comparison to a protein database. 

According to another aspect of the present invention, there is provided 
an electronic data storage device, storing, in a retrievable form, a plurality of 
sequences of peptides identified by the methods described herein. 

According to still another aspect of the present invention, there is 
provided a kit comprising a plurality of individual containers, each of the 
plurality of individual containers containing at least one peptide identified by 
the methods described herein. 

According to yet another aspect of the present invention there is 
provided a method of identifying peptides originating from cancer associated 
proteins and being capable of binding to MHC molecules of a particular 
haplotype, the method comprising obtaining a cancer cell type expressing a 
soluble and secreted form of the MHC molecules of the particular haplotype; 
collecting the soluble and secreted form of the MHC molecules of the 
particular haplotype; analyzing peptides bound to the soluble and secreted 
form of the MHC molecules of the particular haplotype; and identifying 
peptides originating from cancer associated proteins and being capabJe'of 
binding to MHC molecules of the particular haplotype. 
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According to still another aspect of the present invention there is 
provided a method of identifying peptides originating from cells participating 
in the development and/or progression of an autoimmune disease and being 
capable of binding to MHC molecules of a particular haplotype, the method 

5 comprising obtaining cells participating in the development and/or progression 
of the autoimmune disease and expressing a soluble and secreted form of the 
MHC molecules of the particular haplotype: collecting the soluble and secreted 
form of the MHC molecules of the particular haplotype; analyzing peptides 
bound to the soluble and secreted form of the MHC molecules of the particular 

10 haplotype; and identifying peptides originating from proteins participating in 
the development and/or progression of the autoimmune disease and being 
capable of binding to MHC molecules of the particular haplotype. 

According to an additional aspect of the present invention there is 
provided a method of identifying peptides originating from virus infected cells 
and being capable of binding to MHC molecules of a particular haplotype, the 
method comprising obtaining virus infected cells expressing a soluble and 
secreted form of the MHC molecules of the particular haplotype collecting the 
soluble and secreted form of the MHC molecules of the particular haplotype; 
analyzing peptides bound to the soluble and secreted form of the MHC 
molecules of the particular haplotype; and identifying peptides originating 
from the virus and being capable of binding to MHC molecules of the 
particular haplotype. 
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According to yet an additional aspect of the present invention there is 
provided a method of identifying peptides originating from a particular cell 
type characterized by at least one of the following (i) cell over-expressing at 
least one protein; (ii) cells characterized by induced mutations; (iii) cells of 
metastases; (iv) normal or transformed cells expressing cell surface proteins, 
the peptides being capable of binding to MHC molecules of a particular 
haplotype, the method comprising obtaining cells of the particular cell type 
expressing a soluble and secreted form of the MHC molecules of the particular 
haplotype; collecting the soluble and secreted form of the MHC molecules of 
the particular haplotype; analyzing peptides bound to the soluble and secreted 
form of the MHC molecules of the particular haplotype; and identifying 
peptides originating from the particular cell type and being capable of binding 
to MHC molecules of the particular haplotype. 

According to still an additional aspect of the present invention there is 
provided an electronic data storage device, storing, in a retrievable form, a 
plurality of peptides being arranged at least according to their association with 
a pathology and further according to their ability of binding to MHC molecules 
of a particular haplotype. 

According to a further aspect of the present invention there is provided 
an electronic data storage device, storing, in a retrievable form, a plurality of 
peptides being arranged at least according to their association with a protein of 
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interest and further according to their ability of binding to MHC molecules of 
a particular haplotype. 

According to yet a further aspect of the present invention there is 
provided a method of eliciting an immune response against a protein of interest 
in a subject having a particular MHC haplotype, the method comprising 
determining the subject's particular MHC haplotype; and administering to the 
subject an effective amount of at least one peptide derived from the protein of 
interest and which is capable of binding to MHC molecules of the particular 
haplotype . 

According to still a further aspect of the present invention there is 
provided a method of treating a pathology by eliciting an immune response 
against a protein of interest in a subject having a particular MHC haplotype, 
the method comprising determining Hie subject's particular MHC haplotype; 
5 and administering to the subject a therapeutic effective amount of at least one 
peptide derived from the protein of interest and which is capable of binding to 
MHC molecules of the particular haplotype. 

According to an additional aspect of the present invention, there is 
provided a method of eliciting an immune response against a protein of interest 
in a subjects the method comprising using an individualized in vitro assay for 
determining an immune reactivity of an immune system of the subject to a 
plurality of peptides derived from the protein of interest; and administering to 
the subject an effective amount of at least one peptide derived from the protein 
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of interest and which is capable of inducing predetermined sufficient immune 
reactivity. 

According to further features in preferred embodiments of the invention 
described below, administering to the subject the therapeutically effective 
amount of the at least one peptide is accompanied by presenting the at least 
one peptide in context of an antigen presenting cell. 

According to still an additional aspect of the present invention, there is 
provided a peptide selected from the group consisting of SEQ ID NOs;4-6, 
10-14, 19-21, 23-37, 44-88 3 90-141, 143-144, 146-173, 175-189 and 191-195, 
all of which were never reported to bind MHC molecules, 

According to still an additional aspect of the present invention, there is 
provided a peptide selected from the group consisting of SEQ ID NOs: 5, 9, 10 
and 25. 

According to yet an additional aspect of the present invention, there is 
provided a peptide selected from the group consisting of SEQ ID NOs: 13, 20, 
23 and 24. 

According to another aspect of the present invention, there is provided a 
pharmaceutical composition comprising, as an active ingredient, at least one of 
the peptides described herein, and a pharmaceutical^ acceptable carrier. 
Preferably, the at least one of the peptides is presented in context of an antigen 
presenting cell. 
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According to further features in preferred embodiments of the invention 
described below, the peptide comprises at least one modification rendering 
peptides more stable in a body and/or more immunogenic, 

According to still further features in the described preferred 
5 embodiments the at least one modification is selected from the group 
consisting of peptoid modification, semipeptoid modification, cyclic peptide 
modification, N terminus modification, C terminus modification, peptide bond 
modification, backbone modification and residue modification. 

The present invention successfully addresses the shortcomings of the 
20 presently known configurations by providing a novel method for the 
identification of MHC bound peptides. 

BRIEF DESCRIPTION OF THE DRAWINGS 

15 The invention is herein described, by way of example only, with 

reference to the accompanying drawings. With specific reference now to the 
drawings in detail, it is stressed that the particulars shown are by way of 
example and for purposes of illustrative discussion of the preferred 
embodiments of the present invention only, and are presented in the cause of 

20 providing what is believed to be the most useful and readily understood 
description of the principles and conceptual aspects of the invention. In this 
regard, no attempt is made to show structural details of the invention in more 
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detail than is accessary for a fundamental understanding of the invention, the 
description taken with the drawings making apparent to those skilled in the art 
how the several forms of the invention may be embodied in practice. 
In the drawings: 

5 FIGs. 1A-C demonstrate the purification of soluble MHC from cancer 

cells. Soluble MHCs was purified by immunoaffinity from the growth 
medium of 5x1 0 7 transfected cells. Purified proteins were analyzed on 10% 
SDS-PAGE and stained with Coomassie blue. (1A) Purification of 
HLA-A2/QI0 b from MCF-7 cells, (IB) Purification of sHLA-A2 from 
10 UCI-101 cells, (1C) Purification of sHLA-B7 from UCI-107 cells. 

FIGs. 2A-C show a typical nano-capillary reversed-phase 
chromatography of MHC bound peptides purified from soluble MHC from 
5X10 7 MCF-7 breast cancer cells. (2A) The totaMon-current chromatogram 
(TIC). (2B) Mass spectrum taken at the time point of 333 minutes, (2C) 
)5 Spectrum of the collision-induced-disintegration (CID) of the dominant 
peptide in 2B having a m/z of 1028.5 that eluted at the 33.3 minutes. The 
putative MHC peptide GLIEKNIEL (SEQ ID NO; 13) was identified to 
originate from DNA-methyl transferase. 

FIGs. 3A-B show a comparison of the chromatography the MS and the 
20 CID spectra of the synthetic peptide; GLIEKNIEL (SEQ ID NO;13) of the 
DNA methyl transferase (3A) with those of the peptide m/z-1028.5 (SEQ ID 
NO; 13) from the breast cancer line MCF-7 (3B). 
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FIGs, 4A-D demonstrate the evaluation of the correctness of the 
MAGE-B2 peptide pl091 (GVYDGEEHSV, SEQ ID NO:20) by comparing 
the retention time and CID spectra of the synthetic peptide (4A) to that of the 
natural peptide m/2-1091.4 (SEQ ID NO:20) from the ovarian cancer line 
5 UCI-107 (4B). (4C) Evaluation of the binding affinity of peptide pl09l (SEQ 
ID NO:20) to HLA-A2 by reconstituting it into cells surface empty MHC of 
the RMA-S-HHD cells as assayed by FACS analysis. (4D) The homology 
between this MAGE-B2 peptide, pl09l (SEQ ED NO;20) to two other already 
known HLA-A2 peptides derived from homologous region in MAGE-A4 

10 GVYDGREHTV (SEQ ID NO:38) [27] and MAGE-A10 proteins 
GLVDGMEHL (SEQ ID NO:39) [28]. 

FIG. 5 shows an example of reconstitution of peptides into cells surface 
MHC to test their binding and affinity as assayed by FACS analysis. Synthetic 
peptides were added to 10 6 RMA-S-HHD cells to a concentration of 100 mM 

15 followed by incubation for two hours at 26°C and two hours at 37°C. The 
stability of the peptides binding to the HHD cells was measured by indirect 
FACS assay after decoration for another hour with the W6/32 mAb at 4°C and 
30 minutes incubation with FITC goat anti-mouse Ab at 4°C. The HLA-A2.1 
peptide derived from gplOO served as a positive control and unloaded 

20 RMA-S-HHD cells as a negative control. 

FIG. 6 demonstrates a CTL assay with murine cells presenting human 
MHC (EL4-HHD). Cells were loaded separately with individual peptides, 
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washed and injected in four groups: 1- pl028 (SEQ ID NO:13) alone, 2- 
pl258 (SEQ ID NO:24) alone, 3- pool of peptides: p913 (SEQ ID NO:5) 3 p958 
(SEQ ID NO:9), P 989 of CD59 (SEQ ID NO;ll) and p989 of FLI (SEQ ED 
NO:12), 4- peptides pl031 (SEQ ID NO:14), pll21 (SEQ ID NO:22) and 
S pl068 (SEQ ID NO:16). Unloaded EL4-HHD or targets cells not loaded with 
the peptides were used as negative controls. An effector-to-target ratio of 50:1 
is shown. 

io DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention is of a method of identifying peptides of a desired 
origin, such as tumor associated antigens, pathogen (e.g., virus, bacteria) 
derived antigens, endogenous cytokines, etc., which are capable of binding to 
MHC molecules of a particular haplotype. The present invention is further of 
15 peptides identified by the method and pharmaceutical compositions containing 
the peptides. Still, the present invention is further of databases describing the 
peptides and the use of the peptides in vaccination to treat and/or prevent 
various pathologies, cancer and autoimmune diseases, in particular. 

The principles and operation of the present invention may be better 
20 understood with reference to the drawings and accompanying descriptions. 

Before explaining at least one embodiment of the invention in detail, it 
is to be understood that the invention is not limited in its application to the 
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details set forth in the following description or exemplified by the Examples. 
The invention is capable of other embodiments or of being practiced or carried 
out in various ways. Also, it is to be understood that the phraseology and 
terminology employed herein is for the purpose of description and should not 
5 be regarded as limiting. 

According to one aspect of the present invention there is provided a 
method of identifying peptides originating from a particular cell type and being 
capable of binding to MHC molecules of a particular haplotype. The method 
according to this aspect of the present invention is effected by obtaining a cell 
type expressing a soluble and secreted form of the MHC molecules of the 
particular haplotype; collecting the soluble and secreted form of the MHC 
molecules of the particular haplolype; and analyzing peptides bound to the 
soluble and secreted form of the MHC molecules of the particular hapiotype, 
thereby identifying the peptides originating from the particular cell type and 
being capable of binding to MHC molecules of the particular hapiotype. 

According to another aspect of the present invention there is provided a 
method of identifying peptides originating from cancer associated proteins and 
being capable of binding to MHC molecules of a particular hapiotype, The 
method according to this aspect of the present invention is effected by 
obtaining a cancer cell type expressing a soluble and secreted form of the 
MHC molecules of the particular haplotype; collecting the soluble and secreted 
form of the MHC molecules of the particular haplotype; analyzing peptides 



29-MPY-2001 20=52 FROM: 



TO: 972 9 7706333 



P. 019 



19 

bound to the soluble and secreted form of the MHC molecules of the particular 
haplotype; and identifying peptides originating from cancer associated proteins 
and being capable of binding to MHC molecules of the particular haplotype. 

According to still another aspect of the present invention there is 
provided a method of identifying peptides originating from celts participating 
in the development and/or progression of an autoimmune disease and being 
capable of binding to MHC molecules of a particular haplotype. The method 
according to this aspect of the present invention is effected by obtaining cells 
participating in the development and/or progression of the autoimmune disease 
and expressing a soluble and secreted form of the MHC molecules of the 
particular haplotype; collecting the soluble and secreted form of the MHC 
molecules of the particular haplotype; analyzing peptides bound to the soluble 
and secreted form of the MHC molecules of the particular haplotype; and 
identifying peptides originating from proteins participating in the development 
and/or progression of the autoimmune disease and being capable of binding to 
MHC molecules of the particular haplotype. 

According to another aspect of the present invention there is provided a 
method of identifying peptides originating from virus infected cells and being 
capable of binding to MHC molecules of a particular haplotype. The method 
according to this aspect of the present invention is effected by obtaining virus 
infected cells expressing a soluble and secreted form of the MHC molecules of 
the particular haplotype; collecting the soluble and secreted form of the MHC j 



20 

molecules of the particular haplotype; analyzing peptides bound to the soluble 
and secreted form of the MHC molecules of the particular haplotype; and 
identifying peptides originating from the virus and being capable of binding to 
MHC molecules of the particular haplotype. 

According to still another aspect of the present invention there is 
provided a method of identifying peptides originating from a particular cell 
type characterized by at least one of the following (i) cell over-expressing at 
least one protein; (ii) cells characterized by induced mutations; (iii) cells of 
metastases; (iv) normal or transformed cells expressing cell surface proteins, 
the peptides being capable of binding to MHC molecules of a particular 
haplotype. The method according to this aspect of the present invention is 
effected by obtaining cells of the particular cell type expressing a soluble and 
secreted form of the MHC molecules of the particular haplotype; collecting the 
soluble and secreted form of the MHC molecules of the particular haplotype; 
analyzing peptides bound to the soluble and secreted form of the MHC 
molecules of the particular haplotype; and identifying peptides originating 
from the particular cell type and being capable of binding to MHC molecules 
of the particular haplotype. 

In general the present invention provides a method of identifying 
peptides originating from at least one protein of interest or an unknown protein 
and being capable of binding to MHC molecules of a particular haplotype. 
The method is effected by obtaining cells co-expressing the at least one protein 
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of interest or unknown protein and a soluble and secreted form of the .MHC 
molecules of the particular haplotype; collecting the soluble and secreted form 
of the MHC molecules of the particular haplotype; analyzing peptides bound to 
the soluble and secreted form of the MHC molecules of the particular 
haplotype; and identifying peptides originating from the at least one protein of 
interest or unknown protein and being capable of binding to MHC molecules 
of the particular haplotype. Depending to a great extent on the cell type 
employed, it will Once a peptide of an unknown protein is identified, this 
protein becomes a protein of interest. 

The protein of interest or unknown protein can be a native protein 
expressed by the cells, or the protein of interest can be expressed by the cells 
following transformation of the cells with nucleic acid encoding for the protein 
of interest using techniques well known in the art. 

The method of the present invention can thus be used to associate 
proteins of yet unknown pattern of expression with particular tissues or cell 
types, such as cancer cells. In addition, the method of the present invention 
can be used to determine whether a specific open reading firame (ORF) is 
expressed or not in certain ceils. 

In one preferred embodiment of the present invention the cell type is a 
cancer cell or a cancer cell line. Primary cell lines, metastatic cell lines, tumor 
cell lines and normal cell lines which are suitable for implementing the method 
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of the present invention are available, for example, from ATCC. Tables 1 and 
2 below provide examples: 



TABLE 1 
Primary and metastatic cell lines 



Primary Cell Una 


Metastatic Cell Line 


ATCC 
NO. 


Name 


Disease 


Tissue 


ATCC 
NO. 


Name 


Tissue 


CCU228 


$W480 


colorectal 
adenocarcinoma 


colon 


CCL-227 


SW620 


lymph 
node 


CRL-1864 


RM 


gastric adenocarcinoma 


stomach 


CRL-1883 


RF-48 


ascites 


CRL-1675 


WM-115 


melanoma 


skin 


CRL-1676 


WM-266-4 


n/a 


CRL-7425 


Hs 88fi(A).T 


melanoma 


skin 


CRL-7425 


H$ 6$a($).T 


lymph 
node 



TABLE 2 
Tumor and normal cell lines 



Tumor Cell Line 


Normal Cell Line 


ATCC 
Ho. 


Name 


Cancer 
Type 


Tlssuo 
Source 


ATCC 
NO. 


Name 


Tlssuo 
Source 


CCL-2S6 


NCI-H2126 


carcinoma: non-small 
cell iunq cancer 


lung 


CCL-258.1 


NCI-BL2126 


peripheral 
blood 


CRL-5868 


NCI-H1395 


adano carcinoma 


lung 


CRL-5957 


NCI-BL1395 


peripheral 
blood 


CRL-5872 


NCI-H1437 


adenocarcinoma 


lung 


CRL-S958 


NCI-BL1437 


peripheral 
blood 


CRL-5862 


NCI-H1648 


adenocarcinoma 


lung 


CRL-59S4 


NChBL1648 


peripheral 
blood 


CRL-S811 


NCI-H2GQ9 


adenocarcinoma 


lung 


CRL-5981 


NCl-BL2fj09 


peripheral 
blood 


CRL-5985 


NC1-H2122 


adenocarcinoma 


pleural effusion 


CRL-5967 


NCI-BL2122 


peripheral 
blood 


CRL-592Z 


NCI-H2067 


adenocarcinoma 


lymph node 
(metastasis) 


CRL-5965 


NCI-BL2Q87 


peripheral 
blood 


CRL-5888 


NCI-H1072 


carcinoma; classic 
small cell lung cancer 


lung 


CRL-S959 


NCI-BL1672 


penpberal 
blood 


CRL-5929 


NCI-H2171 


carcinoma: small celt 
iuna cancer 


lung 


CRL-6969 


NC1-BL2171 


peripheral 
blood 


CRL-5931 


NCI-H2195 


carcinoma; small cell 
lung cancer 


lung 


CPL-5956 


NC1-0L2195 


peripheral 
blood 


CRL-5858 


NCI-H1184 


carcinoma; small ceil 
lung cancer 


lymph node 
(mete stasia) 


CRL-SS49 


NCI-BL1184 


peripheral 
blood 


HTB-172 


NCI-H209 


carcinoma; small cell 
(una cancer 


bone marrow 
(metastasis) 


CRL-5948 


NCI-BL209 


peripheral 
blood 


CRL-5963 


NCI-H21Q7 


carcinoma, email cell 
lung cancer 


bone marrow 
(metastasis) 


CRL-6966 


NCI-BL2107 


peripheral 
blood 


HTS.120 


NCI-H128 


carcinoma: small cell 
lung cancer 


pleural effusion 


CRL-5947 


NCI-BL128 


peripheral 
blood 


CRL-5915 


NCI-H2052 


mesothelioma 


pleural effusion 


CRL-5963 


NCI-BL2062 (peripheral 
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blood 




NCI-H1770 


neuroendocrine 
carcinoma 


ymph nod* 
(metastasis) 


CRL^5960 


NCI-BL1770 


peripheral 
blood 


HTB-128 


Hs S76T 


ductal carcinoma 


mammary 
gland; breast 


HT&-125 


hs 57ee&i 


mammary 

gland; 

breast 


CRL-2320 


HCC1O08 


ductal carcinoma 


mammary 
qland: breast 


CRU2319 


HCC1O07 
BL 


peripheral 
blood 


CRL-2336 


HCC1964 


ductal carcinoma 


mammary 
aland; breast 


CRL-2339 


HCC1954 
BL 


peripheral 
blood 


CRL-2314 


HCC38 


pnmary ductal 
carcinoma 


mammary 
aland; breast 


CRL-2346 


HCC38 BL 


peripheral 
blood 


CRL-2321 


HCC1143 


primary ductal 
carcinoma 


mammary 
aland: breast 


CRL-2352 


HCC1143 
BL 


peripheral 
blood 


CRL-2322 


HCC1187 


pnmary ductal 
carcinoma 


mammary 
Qland: breast 


CRL-2323 


HCC1187 
BL 


peripheral 
blood 


CRL-2324 


HCC1385 


primary ductal 
carcinoma 


mammary 
gland; breast 


CRL-2325 


HCC1385 
BL 


peripheral 
blood 


CRL-2331 


HCC1599 


primary ductal 
carcinoma 


mammary 
Aland, breast 


CRL-2332 


HCC1599 
BL 


paripherai 
blood 


CRL-2336 


HCC1937 


primary ductal 
carcinoma 


mammary 
Aland: breast 


CRL-2337 


HCC1937 
&L 


peripheral 
blood 


CRL-2340 


HCC2157 


primary ductal 
carcinoma 


mammary 
Aland: breast 


CRL-2341 


HCC2157 
BL 


peripheral 
blood 


CRL-2343 


HCC2218 


primary ductal 
carcinoma 


mammary 
Aland; braast 


CRL-2363 


HCC2218 
BL 


peripheral 
blood 


CRL-7345 


He 574.T 


ductal carcinoma 


mammary 
gland: braast 


CRL-7346 


Hb 574.Sk 


skin 


CRL-74B2 


HS 742.T 


scirrhous 
adenocarcinoma 


mammary 
qland; breast 


CRL-7481 


Hs 742.Sk 


skin 


CRL-73D3 


Hs 496T 


cancer 


mammary 
Aland; breast 


CRL-7302 


Hs 496.Sk 


skin 


CRL-7486 


Hs 748.T 


cancer 


mammary 
gland, breast 


CRL-7486 


Hs 748.Sk 


skin 


CRL-7365 


Hs 605.T 


carcinoma 


mammary 
Aland; braast 


CRL-7364 


Ha 605.Sk 


skin 


CRL-7368 


Hs6Q6 


carcinoma 


mammary 
gland; braast 


CRL-7367 


Hs6Q6,$k 


skin 


CRL-1974 


COLO 829 


malignant malanoma 


aKin 


CRL-1980 


COLO 
823BL 


peripheral 
blood 


CRL-7762 


TE 3S4.T 


basal cell carcinoma 


skin 


CRL-7761 


TE 353,Sk 


skin 


CRL-7680 


Hb 939.T 


maliqnant melanoma 


skin 


CRL-7669 


Hs S39 Sk 


skin 


CRL-7380 


Hs 60QT 


melanoma 


skin 


CRL-7359 


Hs600.Sk 


skin 


CRL-7677 


He 925,T 


pagetoid sarcoma 


skin 


CRL-7678 


Hs 925 Sk 


skin 


CRL-7672 


HS919.T 


benign osteoid 
osteoma 


bone 


CRL-7671 


Hs 919.Sk 


skin 


CRL-7554 


Hs 821 T 


giant cell sarcoma 


bone 


CRL-7553 


Hs821,$k 


skin 


CRL-7552 


H& 820.T 


neteropniitc 
osteofication 


bona 


CRL-75S1 


Hs 820.Sk 


skin 


CRL-7444 


Hs 704.T 




bone 


CRL-7443 


He 7Q4.Sk 


skin 


CRL-7448 


Hs 707(A).T 


osteosarcoma 


bone 


CRL-7449 


Hs 

707fB}.£p 


skin 


CRL-7471 


Hs 735.T 


osteosarcoma 


bone 


CRL-7865 


HS 735.SK 


sktn 


CRL-7571 


HB 836.T 


osteosarcoma 


bona 


CRL-757Q 


Hs836.Sk. . 


skin 


CRL-7S95 


Ha 860.T 


osteosarcoma 


bone 


CftL-7519 


Hs?91.Sk 


skin 
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CRL-7A22 


Hs Bfifl.T 


osteosarcoma 


bone 


CCL-211 


Hb888Lu 


lunq 


CRL-7626 


Hb 889.T 


osteosarcoma 


bane 


CRL-7625 


Hs 869,Sk 


skfn 


CPL-762B 


Hs S9Q.T 


osteosarcoma 


bono 


CRL-7627 


Hs 8B0.Sk 


skin 


CRL-7453 


He 709.T 


periostitis; granuloma 


bone 


CRL-7452 


He 709.SK 


skin 


CRL-7432 


HB 696.T 


adenocarcinoma 


unknown 


CRL-7431 


Hs 696.Sk 


sKtn 


CRL-7A86 


Hs 789T 


transitional cell 
carcinoma 


ureter 


CRL-7518 


Hs 7$9,Sk 


skin 


CRL-7547 


HS 814T 


giant cell sarcoma 


vertebral 
column 


CRL-7546 


Hs B14.Sk 


skin 



In another preferred embodiment of the invention, the cell type is a 
virus infected cell or cell line, Table 3 below provides examples of some 
known viruses and the diseases they cause; 



TABLE 3 



Diseases 


Viruses and other pathogens 


African ileftrtin sy ^icknes*; ^African 
trypanosomiasis) 


Trvnanosoma brucei 




Jll V 


Amebiasis 


.rill Lai i IMC Ua JiiiHUjyuL>a 


BSE ("mad cow disease") and nvCJD 




Campylobacter infections 


W Cw Iipy l O LCI 


cnag&s disease (American trypanosomiasis j 


r TVv7i , \flnrtcr*i'tt A (^ni7i 
1 rypiHIUaUJllil vt U£J 


Cholera 


Vibrio choJerae 


Coccidioidomycosis 


Coccidioides imnritis 


Cryptosporidiosis 


Cryptosporidium 


Cyciosporiasis 


Cyclospora 


Dengue fever 


Dengue viruses 


Diphtheria, tetanus, and pertussis 


Toxin-producing strains of Corynebacterium 
diphtherias 


Bordetella pertussis 




Encephalitis 


Japanese encephalitis virus Tickborne 
encephalitis West Nile virus 


Filariasis 


Wuchereria bancroffc and Brugia malayi 


Giardiasis (Giardia infection) 


Giardia intestinalis 


Hantavirus pulmonary syndrome 


Hantavirus 


Hepatitis 


Hepatitis viruses A, B, C, E 


Histoplasmosis 


flistoplasma capsulatmn 


Influenza (flu) 




Leishmaniasis 


Leishraania 


Leptospirosis 


Leptospira 


Lyme disease 


B. burgdorferi sensu stricto, afzelii, or B. 
garinii 


Malaria 


Plasmodium fakipaaim ?. vivax P, ovale and P. 
malarias 


Measles, mumps, and rubella (MMR.) 




Meningitis 


Haemophilus influenzae type b Streptococcus 
pneumoniae and Neisseria meningitidis 


Onchocerciasis (river blindness) 


Onchocerca volvulus 


Plague 


Yersinia pestis 



B9 _ MAY -£00i 20"-55 FROM"- 



TO: 972 9 7706333 



P. 025 



25 



rouoruycnus 






RhabdovMdae, genus Lyssavirus 


Rocky Mountain spotted fever 




Rickettsia 


rickettsii 


severe diarrhea 


Rotavirus 


jaJluOneiLOSlS 


Salmonella 


ouiisTQsom ias js 




Shigellosis 


Shigella 


luOBrcuJosig (irJJ 


Mycobacterium tuberculosis 


Typhoid fever 


Salmonella serogroup Typhi 


Typhus fevers 


rickettsiae 


ctiickenpox 


Varicella 


Vibrio parahaemolyticus 




Viral hemorrhagic fevers (e.g., Ebola, Lassa, 


arenaviruses, filoviruses, bunyaviruses, and 


Marburg, Rift Valley). 


flaviviruses 


Yellow fever 





In yet another preferred embodiment of the present invention, the eel] 
type is a cell involved in a development and/or progression of an autoimmune 
diseases such as T or B cells, and/or an allergic disease or condition, such as 
mast cells. 

In one example, the at least one protein of interest is a tumor associated 
antigen. The tumor associated antigen can be natively expressed by the cells 
or can be expressed by appropriately transformed cells. Table 4 below lists 
some known genes encoding proteins which were identified as tumor 
associated antigens. 



TABLE 4 



Ocne Symbol 


Gene Name 




Disorders; 










A&L1 


v-abf Abelson murine leukemia vtral 
oncoaene homolog 1 


9q34.1 


Leukemia, clironic myeloid 


ABL2 


v-abl Abelson murine loukfitYi'm virgl 
oncogene homolog 2 (arg, Atetsoft-rsiBted 




Leukemia, acutq myeloid, with coslnophjlia 


AKT2 


v-akt murine thymoma virul oncogene 
hotrtoJoft 2 


]&q]3 1-^13.2 


Ovarian carcinoma 


ARH1 


rus horrmloi* sen© family, member I 


lp31 


Ovarian cancer 






3021.1 


Pancceutic cancer 


AXTN2 


axin 2 (coilduclin, axil) 


I?fl23-q24 


Colorectal cancer 
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BAX 


BCL2-associatcd X protein 


t9q!3,3-ql3.4 


Colorectal Cancer 

t-ccll acute lymphoblastic leukemia 


BCPR 


home» boxB9 


I7p13 3 




BRCA1 


breast cancer 1, earty onset 


17421 


Breast cancer- J 
Ovarian cancer 
Breast-ovarian cancer 


BRCA2 


breast cancer 2, early onset 


13ql2,3 


Breast cancer 2, early onset 
Pancreatic cancer 


BRCA3 




llq23 


Breast cancer-? 


BRCA4 




130.21 


Breqgt ameer, type 4 


BRCAX 




I3q21 




ERCDl 




13 


Breast cancer, ductal 


13RCD2 




Ip36 


Breast cancer, ductal 


BUB1 


budding uninhibited by benzimidazoies 1 
(yeast homolog) 


2ql4 


Colorectal cancer with chromos-omal 
instability 


CDttl 


cadherln l, typo L, E^odhcrin (epithelial) 


16q22 1 


Endometrial carcinoma 
Ovarian carcinoma 
Breast cancer, 


OLD 


congenital chloride diarrhea 


7q22-q31 1 


Colon cancer 

Chloride diarrhea, congenital. Finnish type, 


CSF1R 


colony itimulatmg factor 1 receptor, formerly 
MeDonougJl feline sarcoma viral (v-frns) 
oncogene homolog 


5q33,2-C|33 3 


Myeloid malignancy, predisposition to 


CTTNNB1 


catenin (csdherin-asaociated protein), betn t 
(8*kD) 


3p2>p21 3 


Colorectal cancer 

Hepatoblastoma 

Pilonfiatncoma 


CYLD 


cyhndroitiatosia (turban tumor syndrome) 


16ql2-(ji3 


Cylindromfttaiis, familial 


DCC 


deleted in CoioreLtal carcinoma 


I8q21,3 


Colorectal cancer 


DEK 


D£K oncogene fDNA binding) 


fip23 


Leukemia, acute non lymphocytic 


DL£C1 


deleted in lung and esophageal cancer ] 


3(S22.p21.3 


Lung cancer 
Bsophaaeal cancer 


Dmbti 


deleted in malignant brain tumors 1 


I0q25,3-q26 1 l 


Glioblastoma multiforme 
Mcdulloblsfltoina 


DRA 


down-regulflted in adenpma 


7a22-q3l.l 


Colon cancer 

Chloride diarrhea, congenital, Finnish type. 


ELAC2 


elaC (£ eoli) homolog 2 


17p 


Prostate cancer, susceptibility tO 


EP300 


El A binding protein p300 


22qt3 


Colorectal cancer 


ESR1 


estrogen receptor 1 


6q25,l 


Breast cancer 
ftatrc-gen resistance 


ETV6 




12pl3 


Leukemia, acute lymphoblastic 


FSHR 


follicle stimulating hormone receptor 


2p2!-pl6 


Premature ovarian failure 
Ovarian sex cord tumors 


HNPCC7 




15q21.1 


Colorectal cancer, hereditary non polyposis, 
Lype7 


KPCI 


hereditary prostate cancer 1 


lo.24-q25 


Prostate cancer, au&ceptibility to 


1-IPCX 


hereditary prostate cancer, X-linked 


Xq27-q28 


Prostate cancer, susceptibility to 


RRAS 


v-Hfi-ras Harvey rat sarcoma viral oncoaene 

homolog 


tlplSS 


Bladder cancer 


HRPT2 


hyperparathyroidism 2 (with jaw tumor) 


Iq25-q31 


Hyperparulhyroid ism-jaw tumor syndrome 
flYnenjarathyroidisnv 


KAI1 


kangai 1 (suppression of tumongeniciiy <5. 
prostate: CD82 antigen (R2 leukocyte 
antigen, antigen detected by monoclonal and 


LLpll.2 


Prostate cancer, susceptibility to 
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KIT 


v-kit Hurdy-Zuckerman 4 feline iurcoma viral 
oncogene homolog 


4ql2 


PiepaJdism 
Mast ceil leukemia 
Mastocytosis with associated 


K.RAS1P 


v-Ki-rasl Kirsten rat sarcoma 1 viral 
oncogene homolog, processed pseudogene 


!2pl2.l 


Colorectal Adenoma 

Colorectal cancer 


KRAS2 


v-Ki-ras2 Kirsten rat sarcoma 2 viral 
oncoftene homolog 


12pl2.1 


Colorectal adenoma 
Colorectal cancer 


LCF52 


nutochondrinl ribosnnwl protein LI 3 


18qll-ql2 


?Lynch CdnCiir family syndrome TT 


tco 


Jiver cancer oncORene 


2ql4-a21 


Hepatocellular carcinoma 


MADH4 


MAD (motherb against dccapeniaplegic, 
Droaoph'ila) homo) or 4 


]gq2l ! 


Pancreatic cancer 
Polyposis, juvenile intestinal 


MCC 


mutated in colorectal Cancers 


5q21 


Colorectal cancer 


MERTK 


c-mcr prato-Qncofwne tyrosine kinuits 


2ql4,l 


Retinitis pigmentosa, MERTFCrelstod 


MET 


met proto-oneogene (hepatocyte growth 
factor receptor) 


7q3l 


Renal cell carcinoma, papillary. fluniJial und 
sporadic 


MGCT 




I2q22 


Male germ cell tumor 


MLHl 


rnutL (E. coli) homolog 1 (colon cancer, 
nonpolypoais type 2) 


3p2l 3 


Colorectal cancer, hereditary nonpolypoaii, 
type 2 


MPL 


myeloproliferative leukemia virus oncogene 


lp34 


Thrombocytopenia, congenital 
amefiajcarynevtic 


MSH2 


mutS (E, coli) homolog 2 (colon cancer, 
nonpolyposis type 1) 


2p22-p2I 


Colorectal cancer, hereditary non polyposis, 
9P C 1 


MSH6 


mutS (B coli) homolog 6 


2pi6 


Cancel- susceptibility 
Endometrial carcinoma 
Colorectal 


MTACRI 


multiple tumoM&scciflttd chromosome 
region I 


HplS.5 


Wilms* tumor, type 2 

Adrenocortical carcinoma, hereditary. 202300 


MYC 


v-myc avian myelocyiomatosia viral 
oncogene homolog 


*q24,l 2^24.1 3 


Burkitt lymphoma 


NRAS 


neuroblastoma RAS viral (v-ras) oncogene 
homoloR 


lp]32 


Colorectal cancer 


PCAP 


predisposing far prostate cancer 


1q42.2-(|43 


Proatate cancer, susceptibility to 


PCfiC 


3475 


lp3fi 


Prostate cancer, susceptibility to 


PDGF8 


platelet-derived growth factor bete 
polypeptide (simian sarcoma viral (v-s&O 
oncogene homolog) 


22ql2.3-aJ3.1 


Meningioma, SlS-reJated 
IPemiatofibrosarcomfl prottiberans 


PDGFRL 


platelet-derived growth factor receptor-ilke 


Sp22-p2t 3 


Hepatocellular cancer 
Colorectal cancer 


PGL2 


paraganglioma ar Jamil ial glomus tumors 2 


llq)3.1 


Paraganglioma, lumiHal nonchromaffin 


PGL3 


paraganglioma or familial BiomusiumorS 3 


lq2l 


.•.Paragangliomas, familial nonchromaffin,, 3 


PHB 


jirohibitin 


17q21 


Breast cancer, sporadic 


PIK3CA 


phoaphotnMitide-3-lcinasc, catalytic, alpha 
polypeptide 


3q26 3 


Ovarian cancer 


PMSI 


postmeiutic segregation increased (S. 
cercvisriae) 1 


2q3l-q33 


Colorectal cancer, hereditary nor polypodia, 


PMS2 


pOSCmeiotic segregation increased £S, 
cerevi3iao>2 


7p22 


Turcot syndrome with glioblastoma 
Colorectal cancer, 


PPP2R1B 


protein phosphatase 2 (formerly 2A), 
regulatory subunit A (PR 65)„ beta isoform 


Ilq22-q24 


Lung cancer 


PRC A I 


prostate cancer 1 


Iq24-q25 


Pro&tttie Cancer, Susceptibility to 


PRKCA 


protein kinusc C ? alpha 


I7q22-q23 2 


Filnitury tumor, invasive 


JTEN 


phosphatase and tensin homolog (mutated in 
multiple advanced cancers 1) 


10q2-3.3 


Cawden disease 
Lhermitte-Ductos syndrome 


PTPN12 


protein tyrt&ine phosphatase, non-receptor 
type 12 


7qll.23 


Colon cancer 
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RAB27A 1 


RAB27A,_ member RAS oncogene family 


Sa2l \ 


jiiacelii syndrome ._ 




HAD 51 (S cereviaiae) ho mo log (E coli R.ccA 
nomoloe) 


5^15 1 J 


breast cancer, susceptibility to 


RAD54L 


R.AD54 (S.cerevisiae>Like 


lp32 


Lymphoma, non-Hodgkin 

Jreast cancer, invaalve intraductal 


RB1 


retinoblastoma 1 (including osteosarcoma) 


13ql4,l-qU2 


Retinoblastoma 
Osteosarcoma 
Bladder cancer. 


ret 


ret proto-oncogene (multiple endocrine 
neoplasia and medullary thyvoid carcinoma l, 
Hirschsprung disease) 


JOqll 2 


Multiple endocrine neoplasm HA 
Medullary thyroid 


RUKXl 


runfcrelated transcription factor 1 (acute 
myeloid leukemia 1 ; ami! oncogene) 


210,22 3 


Leukemia, acute myeloid 
Platelet disorder, familial, with 


SCLC1 


354 


3d23-d21 


SmalJ-cell cancer of lung 


SLC22AIL 


solute carrier family 22 (organic cation 
irfirwoorrerY member 1 -like 


1 1 pi 5 5 


Breast cancer 

Rhabdomyosarcoma 

Lunft 




uolute carrier family 26 , member 3 


7q22-q31.1 


Colon cancer 

Chloride diarrhea, congenital, Finnish type 




■5WT/SNF rotated matrix associated actin 
dependent regulator of chfOmfitfn, subfemily 
b t member 1 


22qlt 


Rhabdoid tumors 

Rhabdoid predisposition byn drome, familial 


SRC 


v-sre avian sarcoma (SchmidfrRuppm A«2) 
virol oncogene homolog 




Colon cancer, advanced 


SSTR2 


aomatDstatm receptor 2 


17q24 


Lung ctincer, small cell 


STU 


suppression of tumoriflenicity 1 ) (pancreas) 


3o25 


Pancreatic endocrine tumors 


ST12 


suppression of tumori gen icily 12 (pros tate) 


]0pter-qll 


Prostate adenocarcinoma 


ST3 


suppression of turoorigenicitv 3 




Cervical carcinoma 


STA 


suppresuiun of tumori|enitil!y 8 (ovuficin) 


6q26-q27 


Ovarian cance^ serous 


TACSTD2 


tumor-associated calcium signal transducers 


1o32-ql2 


Corneal dya trophy, gelatin 0ua drop-like 


TCF7L2 


transcription factor 7-like 2 (T-tell specific. 

ruvi y V"* ) 


10q25,3 


Colorectal cancer 


TGFBR2 


transforming srowth factor, beta receptor TT 


3p22 


Colon cancer 

Colorectal cancer, hereditary nonpolyposis! 
type 6 


THf*0 


thmmhAivtiAtin ^rnvftliMMVtl iferativi3 Iculccmia 
wHVrnt/V'P" ') CHI 1 \l lijrvl wji'iMl ItVIwhtv iwwoiWmiub 

virus oncogene ligand, megakaryocyte growth 
ant) development factor) 


3q2G,3-q27 


Thrombocytherma, essential 


TNFRSF10B 


tumor necrosis factor receptor buperfamily. 
in em her 10b 


8p22-p21 


Squamous cell carcinoma, head and neck 


TN?R$J:1 1 A 


tumor necrosis fciccor receptor superfamily, 
member 1 1 a, activator of NFKJ3 


18q22.l 


Osteolysis, familial expansile 
Puuet disease of bone, 


TNl f liS'f1A 


tumor necrosis factor receptor superfamily, 


I2p13.2 


Periodic fever, familial 


TNFRSF6 


tumor necrosis factor receptor fmpcrtamily, 
member 6 


]0q24 1 


Autoimmune lymphoprolifemcive syndrome 


I**, rue pc 


lUmor nccrosii itictur tjifiunu; aupenamuj, 

member 5 (hyper-IgM syndrome) 


Xq26 


Immunodeficiency* X-l inked* with hypcr-igM 


TNFSP6 


tumor necrosis factor (li&artd) superfamiJy t 
member 6 


1q23 


Systemic lupus erytherniiLGSiuS, Susceptibility 
to 


TNF 


tumor necrosis factor (TMF superfamily, 
member 2) 


6p21.3 


Mularla. cerebral, susceptibility to 
Septic ihdck 


TOC 


tylosis with oesophageal cancer 


I7q24 


Tylosis with esophageal cuncer 


TP53 


tumor protein p53 (Li-frMment syndrome) 


17pl3.l 


Colorectal cancer 
Li-Fraumem svndromc 
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TP73 


tumor protein p?3 


lp36 


Neuroblastoma 


TSClOl 


tumor susceptibility gene 101 


Upt5.2-pl5.1 


Breast cancer 


VMGLOM 


vcnoua malformation &lomus cells 


Ip22-p21 


Giomus tumors, multiple 


WXI 


Wilms tumor 1 


1lp13 


Wilms tumor, type 1 
Denys-Drash syndrome 
Prosier 


WT2 


Wilms tumor 2 


HpLS.5 


WHmn tumor, type 5 
Adrenocortical carcinomEL, hereditary 



In another example, the at least one protein of interest includes a 



cytokine. Many diseases, including neurodegenerative (e.g., Alzheimer's 
disease) and autoiinmune (e.g., rheumatoid arthritis, multiple sclerosis and the 
like) diseases are caused or accompanied by inflammation, resulting in 

5 infiltration of leukocytes into the inflicted tissue(s). In these diseases 
proinflammatory cytokines and chemokines are believed to play a pivotal role 
in the attraction of leukocytes to the site of inflammation and in the initiation 
and progression of the inflammatory process. In rheumatoid arthritis, for 
example, the role of proinflammatory cytokines, particularly TNF-o and IL-1, 

io in disease manifestation has been intensively studied and explored in 
experimental models that have been expanded to clinical trials, Other 
cytokines such as IL-4, TGF-(3, H-8, IL-17, IL-10, IL-11, DL-12 and IL-15 
have also been implicated in the regulation of rheumatoid arthritis. Such 
regulation can be attributed to either their direct effect on disease 

15 manifestation, their synergistic effect with other proinflammatory 
cytokines/chemokines, or their involvement in the regulation of chemokine 
transcription, and production. 

Chemokines are chemoattractant cytokines that mediate leukocyte 
attraction and recruitment at the site of inflammation. Based on the positions 
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of the first two cysteines, chemokines can be divided into four highly 
conserved but distinct supergene families, C-C, C-X-C, C and C-X3-C. The 
C-C family is primarily involved in the activation of endothelium and 
chemoattraction of T cells and monocytes to the site of inflammation. The 

5 protective competence of anti-C-C chemokine based immunotherapy has been 
demonstrated in experimental autoimmune encephalomyelitis (EAE), and 
rheumatoid arthritis. 

Neutralizing the activity of chemokines as a way to treat various 
diseases has been explored by many researchers. For example, in a recent 

10 study neutralizing antibodies to epithelial neutrophil activating peptide 78 
(ENA-78) were found capable of inhibiting the development of AA if 
administered before but not after the onset of disease [92], In another recent 
study, Barnes et al [93] used anti-human RANTES to ameliorate AA in the 
Lewis rat. Gong et al [94] used an antagonist of Monocyte Chemoattractant 

is Protein 1 (MCP-1) to inhibit arthritis in the MRL-lpr mouse model. Using a 
streptococcal cell wall induced arthritis model it has been shown that anti-IL-4 
and anti MCP-1 antibodies block the disease [95]. The same study 
demonstrated that neutralizing IL-4 by itself, leads to a marked reduction in 
MCP-1 mRNA transcription at the autoimmune site and to inhibition of the 

20 development of disease which further implicates MCP-1 in playing an active 
role in arthritis development 
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In yet another example, the at least one protein of interest includes a 
protein, e.g., a surface protein, of a pathogen, such as a viral pathogen, a 
bacteria] pathogen or a parasite (either mono or multicellular parasite). 

The major histocompatibility complex (MHC) is a complex of antigens 
encoded by a group of linked loci, which are collectively termed H-2 in the 
mouse and HLA in humans. The two principal classes of the MHC antigens, 
class I and class II, each comprise a set of cell surface glycoproteins which 
5 play a role in determining tissue type and transplant compatibility, In 
transplantation reactions, cytotoxic T-cells (CTLs) respond mainly against 
foreign class I glycoproteins, while helper T-cells respond mainly against 
foreign class II glycoproteins. 

Major histocompatibility complex (MHC) class I molecules are 
10 expressed on the surface of nearly all cells. These molecules function in 
presenting peptides which are mainly derived from endogenously synthesized 
proteins to CD8+ T cells via an interaction with the op T-cell receptor. The 
class I MHC molecule is a heterodimer composed of a 46-kDa heavy chain 
which is non-covalently associated with the 12-kDa light chain p-2 
15 microglobulin. Class I MHC-restricted peptides, which are traditionally 
assumed to be 8-10-amino acid-long, bind to the heavy chain ol-o2 groove via 
two or three anchor residues that interact with corresponding binding pockets 
in the MHC molecule. The p-2 microglobulin chain plays an important role in 
MHC class I intracellular transport, peptide binding, and conformational 
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stability [76]. For most class I molecules, the formation of a heterodimer 
consisting of the MHC class I heavy chain, peptide (self or antigenic) and (3-2 
microglobulin is required for biosynthetic maturation and cell-surface 
expression [76]. 

5 Research studies performed on peptide binding to class I MHC 

molecules enable to define specific MHC motifs functional in displaying 
peptides derived from viral or tumor antigens that are potentially immunogenic 
and might elicit specific response from cytotoxic T lymphocytes (CTLs) 
[77,78]. 

10 Soluble MHC multimers posses a high avidity for T-cells since they 

provide multi-point binding of TCRs with their MHC-peptide ligands. As 
such, multiraeric forms (tetramers) of MHC-peptide complexes have been the 
center of much interest recently, because they can be used for direct 
phenotypic characterization of T eel] responses in normal as well as 

15 pathological conditions, thus, providing insight into the pathopysiology and 
mechanisms of various diseases. Recombinant soluble and secreted MHC 
class I and class II complexes including single chain MHC are described in 
[79-91] which are incorporated herein by references. 

There are several thousands of MHC genes, some of which were 

20 cloned. Table 5 below associates the MHC genes into classes and types (6). 
The sequences of the known MHC genes can be found in the Kabat database 
Chttp://immuno.bme.nwu.eduAt , 
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TABLES 





Type 


Number of genes 


MHC Class I 


A,B,C 


1014 


MHC class Tl A chain 


DRDQDP 


348 


MHC class II B chain 


DRDQDP 


1680 



Genes encoding MHC of particular haplotypes can be readily isolated 
5 using techniques well known in the art and reconstructed to encode soluble 
MHC molecules essentially as exemplified in the Examples section that 
follows. Such well known techniques include, for example, PCR 
amplification, enzymatic digestion and ligation. 

According to a presently preferred embodiment of the present invention 
analyzing the peptides bound to the soluble and secreted form of the MHC 
molecules of the particular haplotype is by mass spectrometry, mass charge 
ratio and collision induced disintegration. Edman degradation can also be 
employed in certain cases where a sufficient amount of the pure peptide 
becomes available. 

The identification of the amino acid sequence of a peptide in 
accordance with the teachings of the present invention is preferably effected 
by comparison of the data collected by mass spectrometry, mass charge ratio 
and collision induced disintegration to putative data of mass spectrometry, 
mass charge ratio and collision induced disintegration of known proteins. 

As used herein in the specification and in the claims section below the 
jo term "peptide" includes native peptides (either degradation products or 
synthetically synthesized peptides) and further to peptidomimetics, such as 
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peptoids and semipeptoids which are peptide analogs, which may have, for 
example, modifications rendering the peptides more stable while in a body, or 
more immunogenic. 

Such modifications include, but are not limited to, cyclization, N 
5 terminus modification, C terminus modification, peptide bond modification, 
including, but not limited to, CH2-NH, CH2-S, CH2-SK), G-C-NH, CH2-O, 
CH2-CH2, S=C-NH, CH=CH or CF=CH, backbone modification and residue 
modification. Methods for preparing peptidomirnetic compounds are well 
known in the art and are specified in Quantitative Drug Design, C.A. Ramsden 

10 Gd., Chapter 17,2, F, Choplin Pergamon Press (1992), which is incorporated 
by reference as if fully set forth herein. Further detail in this respect are 
provided hereinunder, 

As used herein in the specification and in the claims section below the 
term "amino acid" is understood to include the 20 naturally occurring amino 

15 acids; those amino acids often modified post-translationaliy in vivo, including 
for example hydroxyproline, phosphoserine and phosphothreonine; and other 
unusual amino acids including, but not limited to, 2-ammoadIpic acid, 
hydroxyzine, isodesmosine, nor-valine, nor-leucine and ornithine. 
Furthermore, the term "amino acid" includes both D- and L-amino acids. 

20 Further elaboration of the possible amino acids usable according to the present 
invention and examples of non-natural amino acids useful in MHC class I, type 
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A2 S recognizable peptide antigens are given hereinunder. Other anchor 
residues are known for other MHC molecules. 

Thus, assume the following positions (P1-P9) in a 9-mer peptide: 
P1-P2-P3-P4-P5-P6-P7-P8-P9 
5 The P2 and P9 positions include the anchor residues which are the main 

residues participating in binding to A2 MHC molecules. Amino acid resides 
engaging positions P2 and P9 are hydrophilic aliphatic non-charged natural 
amino (examples being Ala, Val, Leu, lie, Gin, Thr, Ser, Cys, preferably Val 
and Leu) or of a non-natural hydrophilic aliphatic non-charged amino acid 

io (examples being norleucine (Nle), norvaline (Nva) s o-aminobutyric acid). 
Positions PI and P3 are also known to include amino acid residues which 
participate or assist in binding to MHC molecules, however, these positions 
can include any amino acids, natural or non-natural. The other positions are 
engaged by amino acid residues which typically do not participate in binding, 

15 rather these amino acids are presented to the immune cells. Further details 
relating to the binding of peptides to MHC molecules can be found in 
reference 1 17, see Table V thereof, in particular. 

Hydrophilic aliphatic natural amino acids at P2 and P9 can be 
substituted by synthetic amino acids, preferably Nleu, Nval and/or o 

20 -aminobutyric acid. P9 can be also substituted by aliphatic amino acids 
of the general formula -HN(CH2) n COOH > wherein n = 3-5, as well as by 
branched derivatives thereof, such as, but not limited to, 
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-NH(CH 2 ) n -COON 

I 

R 

5 wherein R is, for example, methyl, ethyl or propyl, located at any one or more 
of then carbons. 

The amino terminal residue (position PI) can be substituted by 
positively charged aliphatic carboxylic acids, such as, but not limited to, 
H 2 N(CH 2 ) n COOIi wherein n = 2-4 and H 2 N'C(NH)-NH(CH2) ft COOH, 

10 wherein n = 2*3, as well as by hydroxy Lysine, N-methyl Lysine or ornithine 
(Orn), Additionally, the amino terminal residue can be substituted by enlarged 
aromatic residues, such as, but not limited to, H2N-(C6H6>CH 2 -COOH, 
p-aminophenyl alanine, H2N-F(NH)-NH-(C6H6>CH 2 -COOH 3 

p-guanidinophenyl alanine or pyridinoalanine (Pal). These latter residues may 

15 form hydrogen bonding with the OH" moieties of the Tyrosine residues at the 
MHC-1 N-terminal binding pocket, as well as to create, at the same time 
aromatic-aromatic interactions, 

Derivatization of amino acid residues at positions P4-P8, should these 
residues have a side-chain, such as, OH, SH or NH 2 , like Ser, Tyr 4 Lys, Cys or 

20 Orn, can be by alkyl, aryl, alkanoyl or aroyl. In addition, OH groups at these 
positions may also be derivatized by phosphorylation and/or glycosylation. 
These derivatizations have been shown in some cases to enhance the binding 
to the T cell receptor. 
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Longer derivatives in which the second anchor amino acid is at position 
P10 may include at P9 most L amino acids. In some cases shorter derivatives 
are also applicable, in which the C terminal acid serves as the second anchor 
residue. 

5 Cyclic amino acid derivatives can engage position P4-P8, preferably 

positions P6 and P7. Cyclization can be obtained through amide bond 
formation, e.g., by incorporating Glu, Asp ? Lys, Orn, di-amino butyric (Dab) 
acid, di-aminopropionic (Dap) acid at various positions in the chain (-CO-NH 
or -NH-CO bonds). Backbone to backbone cyclization can also be obtained 

10 through incorporation of modified amino acids of the formulas 
H-N((CH2) n -COOH)-C(R)H-COOH or H-N((CH 2 ) n -COOH)-C(R)H-NH2 5 
wherein n = 1-4, and further wherein R is any natural or non-natural side chain 
of an amino acid. As stated above, the data presented herein relates to the 
residues of the most abandoned MHC molecule - MHC class I, type A2. This 

15 data was collected over the years via the detailed analysis of thousands of 
peptides that bind to MHC-I, A2. It will be appreciated that the method of the 
present invention allows the collection of data and analysis of peptides that 
bind any other to MHC molecule. 

Cyclization via formation of S-S bonds through incorporation of two 

20 Cys residues is also possible. Additional side-chain to side chain cyclization 
can be obtained via formation of an interaction bond of the formula 
-(-CH^-^-S-CH^-C-, wherein n = 1 or 2 ? which is possible, for example, 
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through incorporation of Cys or homoCys and reaction of its free SH group 
with, e.g., bromoacetylated Lys, Orn, Dab or Dap. 

Peptide bonds (-CO-NH-) within the peptide may be substituted by 
N-methylated bonds (-N(CH 3 )-CO-)> ester bonds (-C(R)H-C-0-0-C(R)-N-), 
5 ketomethylen bonds (-CO-CH2-), o-aza bonds (-NH-N(R)-CO-) 5 wherein R is 
any alkyl, e.g., methyl, carba bonds (-CH2-NH-), hydroxyethylene bonds 
(-CH(OH)-CH2-), thioamide bonds (-CS-NH-), olefinic double bonds 
(-CH=CH-), retro amide bonds (-NH-CO-), peptide derivatives 
(-N(R)-CH2-CO-), wherein R is the "normal" side chain, naturally presented 
l 0 on the carbon atom. 

These modifications can occur at any of the bonds along the peptide 
chain and even at several (2-3) at the same time. Preferably, but not in all 
cases necessary, these modifications should exclude anchor amino acids. 

Natural aromatic amino acids s Trp, Tyr and Phe, may be substituted for 
15 synthetic non-natural acid such as TIC, naphthylelanine (Nol), ring-methylated 
derivatives of Phe, halogenated derivatives of Phe or o-methyl-Tyr, 

Tables 6-7 below list all of the naturally occurring amino acids (Table 
6) and some of the non-conventional or modified amino acids (Table 7). 
20 TABLE 6 



Amino Acid 


Three-Letter Abbreviation 


One-letter Symbol 


Alanine 


Ala 


A 


Arginine 


Arg 


R 


Asparagine 


Asn 


]N 


Aspartic acid 


Asp 


D 


Cysteine 


Cys 


C 


Glutamic e 


Gin 


0 


Glutamic Add 


Glu 


E 


Glycine 


Gly 


G 
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Histidme 


HIS 


H 


Tsoleucine 


Tie 


I 


Leucine 


Leu 


L 


Lysine 


Lys 


K 


Methionine 


Met 


M 


Phenylalanine 


Phe 


f 


Proline 


Pro 


P 


Serine 


Ser 


S 


Tkeonine 


Thr 


T 


Tryptophan 


Trp 


W 


Tyrosine 


Tyr 


Y 


Valine 


Val 


V 


Any amino acid as above 


Xaa 


X 



Table 7 



Non-conventional amino acid 


Code 


Non-conventional amino acid 


Code 


a-aminobutyric acid 


Abu 


L-N-methyJalanine 


Nmala 


a-amino-a-methylbutyrate 


Mgabu 


L-N-methylargmine 


Nmarg 


arninocyclopropane- 


Cpro 


L-N-methylasparagine 


Nmasn 


caiboxylate 




L-N-rnethylaspartic acid 


Nmasp 


aminoisobutyric acid 


Aib 


L-N-m ethy Icystei tie 


Nmcys 


aminonorbornyl- 


Norb 


L-N-methylglulamine 


Nmgin 


carboxylate 




L-N-methy [glutamic acid 


Nmglu 


cv c 1 n hfiwl s\] a n i ne 


V* J LCA.U 


T -AJ»m^ l 1n\/(l 1 fiC!l t i(*lirti> 
U, Li"HIS3w ly UUaLlUlJlti 


WlYlVll c 

rNiiiiiis 


cyclopentylalanine 


Cpen 


L-N-methylisolleucine 


Nmile 


D-alanine 


Dal 


L-N-methylleucine 


Nmteii 


£>~arginine 


Darg 


L-N-methyllyaine 


NmJys 


D-a$partic acid 


Dasp 


L-N-methylmetliionine 


Nmmet 


D-cysteme 


Dcy$ 


L-N-methy Inorleuc ine 


Nmnte 


T^-tr] i itStmin A 


ugin 


ij-i>i"n]ciDyjnorvauu9 


iNmnva 


D-glutamic acid 


Dglu 


L-TM-tnethylornithine 


Nmorn 


D-bistidme 


Dhis 


L-N-msthy [phenylalanine 


Nrnphe 


D-isoleucine 


Dile 


L-N*methyIproline 


Nmpro 


D-leiicme 


Dleu 


L-N-methyl$enne 


Ninser 


D-Jysine 


Diys 


L-N-methylthreon ine 


Nmthr 


D-methioniiie 


Dmct 


L-N-methyltryptophan 


Nmtrp 


D-omi thine 


Dom 


L-N-methyltyrosine 


Nmtyr 


D-phenylalanine 


Dphe 


L-N-in ethyl valine 


Nmval 


D-proline 


Dpro 


L-N-methyletbyl&lycirte 


Nmetg 


D-serine 


Dser 


I^N-methyl-t-butylglycine 


Nmtbug 


D-threoriine 


Dthr 


L-norleucine 


NIe 


D-tryptopban 


Dtrp 


L-uorvaLine 


Nva 


D-tyrosine 


Dtyr 


a-methyl-atnino isobutyrate 


Matb 


D-valine 


Dval 


a-meihyl-y-am inobutyrate 


Mgabu 


D-CC-melhylalanine 


Dmala 


a-methylcyclohexylalanine 


Mchexa 


D-a-metfiylargmine 


Dmarg 


ot-methy1cyciopentylalanine 


Mcpen 


D-a-methylasparagine 


DmasQ 


a-methyl-a-napthyl$!anine 


Manap 


D-a-metbylaiUpartate 


Dmasp 


(X-metbylpenicillamine 


Mpen 


D-a-methyicysteine 


Dmcy$ 


N~(4~ammobutyl)glycine 


Nglu 


D-CX-methyiglutamtne 


Dmgln 


N-(2-arni aoethyl)g]ycine 


Naeg 


D-a-mcthylhistidine 


Dmhis 


N-(3-aminopropyl)glycinfc 


Norn 
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D-a-methylisoleucLne 


umiie 


N- amino-Oc-raethylbutyrate 


Nmaabu 


D-a-methyUeucine 


Dm leu 


a-napthyialaniiie 


Anap 


i-KX-nieuiyJJysme 


Dmlys 


N-benzy [glycine 


Nphe 


D-a-metfaylmethion in e 


Dmmet 


N-(2-carbamy1ethy])glycine 


Ngb 


D-Ot-metJiylornithlne 


Dmom 


N-(carbamylmethy])glycine 


Nasn 


D-CX-methylphenylalanine 


Dmpbe 


N-(2~carboxyetfiy])glycme 


Nglu 


D-ot-methylprol ine 


Dmpro 


N-(carboxym0thyl)glycine 


Nasp 


1T\ f\i JdU A. 4(1^11 jl J* A i 

D-a-methyisenn6 


Dtrisej* 


N-cyctobutylglycme 


Ncbut 


D-cc-methylthreonine 


rati it 


N -cycloh ep tyl gly gjne 


Nchep 


D-Gt-methyitryptophaTi 




N-cyclohexyJglyclrtfi 


Nchex 


D- Ot-methy Ity rosin $ 


T^ITlKf 


N-cyc3 odecy 1 gJycin $ 


Ncdec 


D-ot-tttcthy 1 valine 


JjJUVal 


N-cyclododeclglycine 


Ncdod 


D-a-methylalnine 


Dnmak 


N-cycioocrtylglycine 


Ncoa 


D-a-methylarginine 


Dnmarg 


N-cycIoprOpylglycine 


Ncpro 


D-a-inethylasparagmfi 


Dnmasn 


N-cycloundecylg]ycine 


Ncund 


D-Ot-methylasparatatc 


Dnmasp 


N-(2 7 2-dipbeTiy]etbyl)glycH]e 


Nblim 


ly-UL^nicLny i uy sic i ne 




1 1 \y j-^ u i p* i ci \y \ pi up yj jgiyciiitj 


JNDUC 


H"JN -niemy i leucine 


Dumleu 


N-p-indolylyethyi) jilycine 


Nhtrp 


U"! i "rnciny l 1 y am c 


Dnmlys 


N-methyi-y-arainobutyrate 


Nmgabu 


NwriethyleycJohexylalanine 


Nmchexa 


D-N-methylniethiorrme 


Dinmnet 


D-N-methylorriithiiie 


Dnmorn 


N-methylcyclopentylalaninc 


Nmcpen 


N-methyiglycine 


Nala 


D-N-methylphenylaJaiiiue 


Dtiraphe 


N-methylamitioisobutyrate 


Nmaib 


D-N~methy [proline 


Dttmpro 


N-( I -methylpropyDglyciTie 


Nile 


D-N-methylserine 


Dnmser 


N-(2-methy]propyl)glyciae 


Nile 


D-N-methylserine 


Dnrnser 


N-(2-methylpropyl)g.lycme 


Nleu 


D-N-methyltlii-conine 


Dnmthr 


D-N-methvltrvntnnhan 




* nictnyjciiiyj jgjycjiie 


Nva 


D-N-methyltyros ine 


Dnmtyr 


N-methyla-napthytalanine 


Nmanap 


D-N-methylvaline 


Dnmval 


N-mathylpemciNaniine 


Ntnpen 


y-aminobutyric acid 


Gabu 


N-(p-hydroxypbeuyl)g]yciiie 


Nbtyr 


L-t-butylglycIne 


Thug 


N-(thiomethy])glycine 


Ncys 


L-ethylglycine 


Etg 


penicillamiTie 


Pen 


jL--n o mop ricny laiaji uie 


xipne 


L-a-Jnethyialanine 


Mala 


L-a-methylarginine 


Marg 


L-a-mechylasparagine 


Masn 


L-a-methylaspartate 


Masp 


L-C3t- methy l-t-buty Iglyckte 


Mtbug 


L-cc-methylcysteine 


Mcys 


L-methylethylglycine 


Metg 


1 -m etVivl <r] i its m in p 


Mgtn 


Iri ■ U*." 1 1 U iy Ig 1 LlLiUTl alC 


Mglu 


L-a-methylhistidine 


MJus 


L-a-methylborao phenyl a lamue 


IVJUpnc 


L-a-raethylisoJeucine 


JVlllt/ 


rM-^z^nictuyiuiioetnyijgiyciJie 


JNulcl 


D-N-raethyJ^lulaiTiijie 


Dnmgln 


N-(3 -guanidmopro pyl)glyc we 


Narg 


D-N-methylglutamate 


Dnmglu 


N-(l -hydroxyethyl)g!ycine 


Ntbr 


T"*\ XT ~. -iL. .11- J ' 

J>N-methylhistiaino 


Dnmhis 


N-(nydroxyetnyl)glyCitie 


Nser 


D-N-Tnetbyli$oleacme 


Dnmile 


N-(imidazolylethyl)glycLne 


Nhis 


D-N-methylleucine 


Dnmleu 


N-(3-mdolyIyetbyl)elycine 


Nhtrp 


D-N-methyUysine 


Dnmlys 


N-Tncthyl-y -amtnobutyrate 


Nmgabu 


N-methylcyclohexylalanme 


Nmchexa 


D-N-methylniethionine 


Dammet 


D-N-methylomithine 


Dttmom 


N-methyJcyclopentylalaTimc 


Nmcpen 


M-methylglycine 


Nala 


D-N -methylphenylalanine 


Dmnphe 
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N-methylaminoisobutyrate 


Njnaib 


D-N-m etbylproline 


Dnttipro 


N-0 -methy lpropy])g]ycine 


Nile 


D-N- methyl a c rin e 


lr* J 1 1 1 l*J W 1 


N-(2-rtiethyIpropyl)g]ycirifc 


NJeu 


D-N-methylthreonine 


Dncnthr 


D-N-methyltryptophan 


Dnmtrp 


N-( 1-metbylethy (glycine 


Nval 


D-N-methyltyrosine 


Dnmtyr 


N-methyla-napthylalatiine 


Nmanap 


D-N-methyl valine 


Dnmval 


N-methy Ipeuici 1 lamina 


Nmpen 


Y-aniiflobutync acid 


Gabu 


N-^o-hvdroxvohiinvl^Elvckifi 


Nbtyf 




TKno 


X[-/th i ftni fifth vftpl v£ine 


Ncys 


L-etiiylglycine 


Etg 


penicillamine 


Pea 


L-homopheaylalanine 


Hphe 


L-a-metbylalam'Tie 


Mala 


L-a-meUiyJarginino 


Marg 


i-a-methylasparagine 


Masa 


L-a-methylaspamte 


Masp 


L-Oc-methyl-t-butylglycine 


Mtbug 


L-ot-methylcysteine 


Mcys 


L-niethylethylglycvne 


Metg 


.L-Ct-methylglntamijie 


Mglu 


L-a-methylglutamate 


Mglu 


L-ot-methylh isti dine 


Mb is 


L-Cc-raethylhomopbeTiylalanine 


Mhpbe 


L-ct -methyl iso leucine 


Mile 


N-(2-metIiy]thioethyJ)glycine 


Nrnot 


L-a-mechyileucine 


MIeu 


L-a-metbyllystoe 


Mlys 


L-a-methylmethionine 


Mmet 


L-Ct-methytoorleucine 


JYUllC 


L-a-methylnorvalirie 


Mnva 


L-a-metbyJornitfiine 


Morn 


L-a-methylphenylalaiiine 


Mphe 


L-a-methylproLine 


Mpro 


L-Ot-methy Is&ine 


mser 


L-a-me&ylthTeonine 


Mthr 


L-a-methylvalme 


Mtrp 


L-a-methyltyrosine 


Mtyr 


L-a-metbyUeiicine 


Mva1 
Nnbbm 


L-N-methylhomopbenylalamiie 


Nmhphe 


N-Osf^^-diphenylethyl) 




N-{N-(3,3-diphenyipropyl) 




carbamyhuethyl-glycine 


"Nubbin 


caibamylmethyl( l)gl vcioe 


Nnbhe 


l-carboxy-l-(2,2-dipbenyl 
ethylamiuotevclopropane 


Nmbc 







A peptide according to the present invention can be used in a self 
standing form or be a part of a larger moiety such as a protein or a display 
moieties such as a display bacterium, a display phage or preferably a display 
5 cell. 

Additionally, a peptide according to the present invention includes at 
least five, optionally at least six, optionally at least seven, optionally at least 
eight, optionally at least nine, optionally at least ten, optionally at least eleven, 
optionally at least twelve, optionally at least thirteen, optionally at least 
10 fourteen, optionally at least fifteen, optionally at least sixteen or optionally at 
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least seventeen, optionally between seventeen and twenty five or optionally 
between twenty five and at least thirty amino acid residues (also referred to 
herein interchangeably as amino acids). 

Accordingly, as used herein the term "amino acid" or "amino acids" is 
5 understood to include the 20 naturally occurring amino acids; those amino 
acids often modified post-translationally in vivo, including, for example, 
hydroxyproline, phosphoserine and phosphothreonine; and other unusual 
amino acids including, but not limited to, 2-aminoadipic acid, hydroxyzine* 
isodesmosine* nor-valine, nor-leucine and ornithine, Furthermore, the term 
10 "amino acid" includes both D- and L-amino acids. 

As used herein the phrase "derived from a protein" refers to peptides 
derived from the specified protein or proteins and further to homologous 
peptides derived from equivalent regions of proteins homologous to the 
specified proteins of the same or other species, provided that these peptides are 
15 effective as vaccines, such as anti-tumor vaccines. The term further relates to 
permissible amino acid alterations and peptidomimetics designed based on the 
amino acid sequence of the specified proteins or their homologous proteins. 

As used herein the phrase "anti-tumor vaccines" refers to a vaccines 
effective in preventing the development of, or curing, cancer, including 
20 primary tumor and/or metastases , 

The peptides of the invention can be administered per se or as an active 
ingredient in a pharmaceutical composition which may further include a 
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pharmaceutical^ acceptable carrier. Preferably, one or more peptides of the 
invention are presented in context of an antigen presenting cell. The most 
common cells used to load antigens are bone marrow and peripheral blood 
derived dendritic cells (DC), as these cells express costimulatory molecules 
5 that help activation of CTL, Nevertheless, the peptide presenting cell can also 
be a macrophage, a B cell or a fibroblast. Presenting the peptide can be 
effected by a variety of methods, such as, but not limited to, (a) transforming 
the presenting cell with at least one polynucleotide (e.g„ DNA) encoding at 
least one peptide; (b) loading the presenting cell with at least one 
io polynucleotide (e.g., UNA) encoding at least one peptide; (c) loading the 
presenting cell with at least one peptide (e.g., synthetic peptide); and (d) 
loading the antigen presenting cell with at least one longer polypeptide (e.g., 
purified) including at least one peptide. Loading can be external or internal 
The polynucleotide, peptide or longer polypeptide can be fused to internalizing 
15 sequences* antennapedia sequences or toxoid sequences or to helper 
sequences* such as, but not limited to, heat shock protein sequences, 

As used herein a "pharmaceutical composition" refers to a preparation 
of one or more of the peptides described herein, with other chemical 
components such as phaimaceutically suitable earners and excipients. The 
20 purpose of a pharmaceutical composition is to facilitate administration of a 
compound to a subject. 
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Hereinafter, the term "pharmaceutically acceptable carrier" refers to a 
carrier or a diluent that does not cause significant irritation to a subject and 
does not abrogate the biological activity and properties of the administered 
compound. Examples, without limitations, of carriers are propylene glycol, 
5 saline, emulsions and mixtures of organic solvents with water. 

Herein the term "excipient" refers to an inert substance added to a 
pharmaceutical composition to further facilitate administration of a compound. 
Examples, without limitation, of excipients include calcium carbonate, calcium 
phosphate, various sugars and types of starch, cellulose derivatives, gelatin, 
10 vegetable oils and polyethylene glycols. 

According to a preferred embodiment of the present invention, the 
pharmaceutical carrier is an aqueous solution of lactic acid. 

In this respect, it should be pointed out that some of the peptides of the 
present invention, according to preferred embodiments, are readily soluble in 
15 aqueous media and are therefore easily formulated. 

Techniques for formulation and administration of drugs may be found 
in "Remington's Pharmaceutical Sciences," Mack Publishing Co., Easton, PA, 
latest edition, which is incorporated herein by reference, 

Suitable routes of administration may, for example, include oral, rectal, 
20 transmucosal, transdermal, intestinal or parenteral delivery, including 
intramuscular, subcutaneous and intramedullary injections as well as 
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intrathecal, direct intraventricular, intravenous, intraperitoneal, intranasal, or 
intraocular injections. 

Pharmaceutical compositions of the present invention may be 
manufactured by processes well known in the art, e.g., by means of 
5 conventional mixing, dissolving, granulating, dragee-making, levigating, 
emulsifying, encapsulating, entrapping or lyophilizing processes. 

Pharmaceutical compositions for use in accordance with the present 
invention thus may be formulated in conventional manner using one or more 
pharmaceutical^ acceptable carriers comprising excipients and auxiliaries, 
10 which facilitate processing of the active compounds into preparations which 
can be used pharmaceutical^. Proper formulation is dependent upon the route 
of administration chosen. 

For injection, the peptides of the invention may be formulated in 
aqueous solutions, preferably in physiologically compatible buffers such as 
15 Hank's solution, Ringer's solution, or physiological saline buffer with or 
without organic solvents such as propylene glycol* polyethylene glycol and the 
like. For transmucosal administration, penetrants are used in the formulation. 
Such penetrants are generally known in the art. 

For oral administration, the peptides can be formulated readily by 
20 combining the active compounds with pharmaceutically acceptable carriers 
well known in the art. Such carriers enable the peptides of the invention to be 
formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, 
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suspensions, and the like, for oral ingestion by a patient. Pharmacological 
preparations for oral use can be made using a solid excipient, optionally 
grinding the resulting mixture, and processing the mixture of granules, after 
adding suitable auxiliaries if desired, to obtain tablets or dragee cores. 

5 Suitable excipients are, in particular, fillers such as sugars, including 

lactose, sucrose, mannitol, or sorbitol; cellulose preparations such as, for 
example, maize starch, wheat starch, rice starch, potato starch, gelatin, gum 
tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium 
carbomethylcellulose and/or physiologically acceptable polymers such as 

io polyvinylpyrrolidone (PVP). If desired, disintegrating agents may be added, 
such as cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt 

thereof such as sodium alginate. 

Dragee cores are provided with suitable coatings. For this purpose, 
concentrated sugar solutions may be used which may optionally contain gum 
„ arable, talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, ttonium 
dioxide, lacquer solutions and suitable organic solvents or solvent mixtures. 
Dyestuffs or pigments may be added to the tablets or dragee coatings for 
identification or to characterize different combinations of active compound 
doses. 

^. .* * gelatin as weU as soft « «*— made o f 

ge ,atin »d a P-cizer, such as sterol or *>**. The push-fr. caps** 
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may contain the active ingredients in admixture with filler such as lactose, 
binders such as starches, lubricants such as talc or magnesium stearate and ? 
optionally, stabilizers. In soft capsules, the active compounds may be 
dissolved or suspended in suitable liquids, such as aqueous solution, fatty oils, 

5 liquid paraffin, or liquid polyethylene glycols. In addition, stabilizers may be 
added. All formulations for oral administration should be in dosages suitable 
for the chosen route of administration. 

For buccal administration, the compositions may take the form of 
tablets or lozenges formulated in conventional manner. 

10 For administration by inhalation, the peptides are conveniently 

delivered in the form of an aerosol spray presentation from a pressurized pack 
or a nebulizer with the use of a suitable propellant, e.g., 
dichlorodifluoromethane, trichlorofluoromethane, dichloro-tettafluoroethane 
or carbon dioxide. In the case of a pressurized aerosol, the dosage unit may be 

15 determined by providing a valve to deliver a metered amount. Capsules and 
cartridges of, e,g. ? gelatin for use in an inhaler or insufflator may be 
formulated containing a powder mix of the compound and a suitable powder 
base such as lactose or starch. 

The peptides described herein may be formulated for parenteral 

20 administration, e.g., by bolus injection or continuos infusion. Formulations for 
injection may be presented in unit dosage form, e.g., in ampoules or in 
multidose containers with optionally, an added preservative. The compositions 
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may be suspensions, solutions or emulsions in oily or aqueous vehicles, and 
may contain formulatoty agents such as suspending, stabilizing and/or 
dispersing agents. 

Pharmaceutical compositions for parenteral administration include 
5 aqueous solutions of the active compound in water-soluble fonn. 
Additionally, suspensions of the active compounds may be prepared as 
appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles 
include fatty oils such as sesame oil, or synthetic fatty acids esters such as 
ethyl oleate, triglycerides or liposomes. Aqueous injection suspensions may 
10 contain substances, which increase the viscosity of the suspension, such as 
sodium carboxymethyl cellulose, sorbitol or dextran. Optionally, the 
suspension may also contain suitable stabilizers or agents which increase the 
solubility of the peptides to allow for the preparation of highly concentrated 
solutions. 

15 Alternatively, the active ingredient may be in powder form for 

constitution with a suitable vehicle, e.g., sterile, pyrogen-free water, before 
use. 

The peptides of the present invention may also be formulated in rectal 
compositions such as suppositories or retention enemas, using, e.g., 
20 conventional suppository bases such as cocoa butter or other glycerides. 

The pharmaceutical compositions herein described may also comprise 
suitable solid of gel phase carriers or excipients. Examples of such carriers or 
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excipients include, but are not limited to, calcium carbonate, calcium 
phosphate, various sugars, starches, cellulose derivatives, gelatin and polymers 
such as polyethylene glycols. 

Pharmaceutical compositions suitable for use in context of the present 

5 invention include compositions wherein the active ingredients are contained in 
an amount effective to achieve the intended purpose. More specifically, a 
therapeutically effective amount means an amount of peptide effective to 
prevent, alleviate or ameliorate symptoms of pathology or prolong the survival 
of the subject being treated, 

10 Determination of a therapeutically effective amount is well within the 

capability of those skilled in the art, especially in light of the detailed 
disclosure provided herein. 

For any peptide used in the methods of the invention, the therapeutically 
effective amount or dose can be estimated initially from activity assays in cell 

15 cultures and/or animals. For example a dose can be formulated in animal 
models to achieve a circulating concentration range that includes the IC50 as 
determined by activity assays (e.g., the concentration of the test compound, 
which achieves a half-maximal inhibition of the proliferation activity). Such 
information can be used to more accurately determine useful doses in humans. 

20 Toxicity and therapeutic efficacy of the peptides described herein can 

be determined by standard pharmaceutical procedures in experimental animals, 
e.g., by determining the ICSO and the LD50 (lethal dose causing death in 50 % 
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of the tested animals) for a subject compound. The data obtained from these 
activity assays and animal studies can be used in formulating a range of dosage 
for use in human. 

The dosage may vaty depending upon the dosage form employed and 
5 the route of administration utilized. The exact formulation, route of 
administration and dosage can be chosen by the individual physician in view of 
the patient's condition. (See e.g., Fingi, et al, 1975, in 'The Pharmacological 
Basis of Therapeutics", Ch. 1 p,l). 

Dosage amount and interval may be adjusted individually to provide 
10 plasma levels of the active moiety which are sufficient to maintain therapeutic 
effects, termed the minimal effective concentration (MEC). The MEC will 
vary for each preparation, but can be estimated from in vitro and/or in vivo 
data, e.g M the concentration necessary to achieve 50-90 % inhibition of a 
proliferation of certain cells may be ascertained using the assays described 
15 herein. Dosages necessary to achieve the MEC will depend on individual 
characteristics and route of administration. HPLC assays or bioassays can be 
used to determine plasma concentrations. 

Dosage intervals can also be determined using the MEC value* 
Preparations should be administered using a regimen, which maintains plasma 
20 levels above the MEC for 10-90 % of the time, preferable between 30-90 % 
and most preferably 50-90 %. 
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of the tested animals) for a subject compound. The data obtained from these 
activity assays and animal studies can be used in formulating a range of dosage 
for use in human. 

The dosage may vary depending upon the dosage form employed and 
5 the route of administration utilized. The exact formulation, route of 
administration and dosage can be chosen by the individual physician in view of 
the patient's condition. (See e.g., Fingl, et al. s 1975, in "The Pharmacological 
Basis of Therapeutics", Ch. 1 p,l). 

Dosage amount and interval may be adjusted individually to provide 
10 plasma levels of the active moiety which are sufficient to maintain therapeutic 
effects, termed the minimal effective concentration (MEC). The MEC will 
vary for each preparation, but can be estimated from in vitro and/or in vivo 
data, e.g., the concentration necessary to achieve 50-90 % inhibition of a 
proliferation of certain cells may be ascertained using the assays described 
is herein. Dosages necessary to achieve the MEC will depend on individual 
characteristics and route of administration. HPLC assays or bioassays can be 
used to determine plasma concentrations. 

Dosage intervals can also be determined using the MEC value. 
Preparations should be administered using a regimen, which maintains plasma 
20 levels above the MEC for 10-90 % of the time, preferable between 30-90 % 
and most preferably 50-90 %. 
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Depending on the severity and responsiveness of the condition to be 
treated, dosing can also be a single administration of a slow release 
composition described hereinabove, with course of treatment lasting from 
several days to several weeks or until cure is effected or diminution of the 

5 disease state is achieved. 

The amount of a composition to be administered will, of course, be 
dependent on the subject being treated, the severity of the affliction, the 
manner of administration, the judgment of the prescribing physician, etc. 

Compositions of the present invention may, if desired, be presented in a 

10 pack or dispenser device, such as a U.S. Food and Drug Administration (FDA) 
approved kit which may contain one or more unit dosage forms containing the 
active ingredient The pack may, for example, comprise metal or plastic foil, 
such as a blister pack. The pack or dispenser device may be accompanied by 
instructions for administration. The pack or dispenser may also be 

15 accompanied by a notice associated with the container in a form prescribed by 
a governmental agency regulating the manufacture, use or sale of 
pharmaceuticals, which notice is reflective of approval by the agency of the 
form of the compositions or human or veterinary administration. Such notice, 
for example, may be of labeling approved by the U.S. Food and Drug 

20 Administration for prescription drugs or of an approved product insert. 
Compositions comprising a chemical conjugate of the invention formulated in 
a compatible pharmaceutical carrier may also be prepared, placed in an 
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appropriate container, and labeled for treatment of an indicated condition, such 
as a cancer of a certain type, an autoimmune disease or an allergy, 

Peptides of the present invention may be packaged in kits* each such kit 
comprising a plurality of individual containers, each of which containing at 
least one peptide identified by the method of the present invention, Such a kit 
can be used for two purposes. First, an in vitro functional assay, such as the 
CTL assay [20] or the ELISPOT assay [19] of, for example, cytokine (e.g., 
IL-2, TNF alpha or interferon gamma) production or development of 
cytotoxiciy using immune cells derived from a patient can be used to 
determine the immune response of the patient to each one of the peptides, 
which response to a large extent depends on the particular MHC haplotype of 
the patient. Second, once, a reactive peptide or peptides are identified, either 
by an individualized in vitro assay or from in silico data as is further detailed 
below, suitable peptide or peptides from the kit can be administered so as to 
treat the patient. 

According to another aspect of the present invention there is provided 
an electronic data storage device, storing, in a retrievable form, a plurality of 
sequences of peptides identified by the method described herein. Various 
other parameters, such as the parameters identified in the Tables provided in 
the Examples section that follows, can also be linked to the peptide sequences, 
in, for example, a table form or any other form. Preferably, the plurality of 
peptides are arranged at least according to their association with a pathology 
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and further according to their ability of binding to MHC molecules of a 
particular haplotype. This in silico data can be used instead or in addition to 
the in vitro assays described above to match a most active peptide to treat a 
pathology of a certain patient having a particular pre identified MHC 
haplotype. Thus, look up tables associating a peptide with a protein with a 
gene, with a disease with a haplotype, and/or with an efficiency score can be 
constructed and used to best suit a peptide for treatment of a disease in an 
individualized way taking into account the MHC haplotype of the patient to be 
treated. Of course, individualized in vitro assays can be used to ascertain 
peptide selection. 

The electronic data storage device can, for example, be an 
electrornagnetically or electro-optically readable device and it preferably forms 
a part of a server that is accessible by users through a communications 
network, such as the Internet, an intranet or an extranet, via a plurality of user 
clients at the disposal of the users. A management software application 
manages the data stored in the data storage device and is preferably designed 
to support search and retrieval of information from the database and deposition 
of information into the database. 

Thus, further according to the present invention there is provided a 
method of eliciting an immune response against a protein of interest in a 
subject having a particular MHC haplotype. The method according to this 
aspect of the invention is effected by determining the subjects particular MHC 
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haplotype; and administering to the subject an effective amount of at least one 
peptide derived from the protein of interest and which is capable of binding to 
MHC molecules of the particular haplotype. 

Still further according to the present invention there is provided a 
method of eliciting an immune response against a protein of interest in a 
subject. The method is effected by using an individualized in vitro assay for 
determining an immune reactivity of an immune system of the subject to a 
plurality of peptides derived from the protein of interest; and administering to 
the subject an effective amount of at least one peptide derived from the protein 
of interest and which is capable of inducing predeteimined sufficient immune 
reactivity. 

According to another aspect the present invention provides a method of 
treating a pathology by eliciting an immune response against a protein of 
interest in a subject having a particular MHC haplotype. The method is 
effected by determining the subject's particular MHC haplotype; and 
administering to the subject a therapeutic effective amount of at least one 
peptide derived from the protein of interest and which is capable of binding to 
MHC molecules of the particular haplotype. 

As used herein the tenn "treating 11 includes prevention or cure of a 
pathology, such as a disease, syndrom or manifestation, effected by inhibiting, 
slowing or reversing the progression of the disease, syndrom or manifestation, 
substantially ameliorating clinical symptoms of a disease, syndrom or 
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manifestation or substantially preventing the appearance of clinical symptoms 
of a disease, syndrom or manifestation. 

As used herein the term "subject" refers to humans and animals having 
an MHC system, such as the HLA system in humans, in particular farm 
5 animals, It will be appreciated in this respect that the method of the present 
invention can be used to improve all kinds of peptide immunization via 
individualization for both human beings and animals. 

A variety of pathologies can be treated using the peptides of the present 
invention, including, but not limited to, cancers, infections, inflammations, 
10 autoimmune diseases, allergies, etc. The gist of the present invention with, 
respect to treating pathologies lies in the fact that the present invention offers, 
for the first time, individualization of the vaccine to the MHC haplotype of the 
treated subject. 

Additional objects, advantages, and novel features of the present 
15 invention will become apparent to one ordinarily skilled in the art upon 
examination of the following examples, which are not intended to be limiting, 
Additionally, each of the various embodiments and aspects of the present 
invention as delineated hereinabove and as claimed in the claims section below 
finds experimental support in the following examples, 
20 EXAMPLES 

Reference is now made to the following examples, which together with 
the above descriptions, illustrate the invention in a non limiting fashion. 
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Generally, the nomenclature used herein and the laboratory procedures 
utilized in the present invention include molecular, biochemical, 
microbiological and recombinant DNA techniques. Such techniques are 
thoroughly explained in the literature. See, for example, "Molecular Cloning: 

5 A laboratory Manual" Sambrook et al. ? (1989); "Current Protocols in 
Molecular Biology" Volumes I-III Ausubel, R. M, ed. (1994); Ausubel et al. ? 
"Current Protocols in Molecular Biology", John Wiley and Sons, Baltimore, 
Maryland (1989); Perbal, "A Practical Guide to Molecular Cloning", John 
Wiley & Sons, New York (1988); Watson et aL, "Recombinant DNA", 

10 Scientific American Books, New York; Birren et al. (eds) "Genome Analysis: 
A Laboratory Manual Series", Vols. l-4 ? Cold Spring Harbor Laboratory 
Press, New York (1998); methodologies as set forth in U,S. Pat. Nos. 
4,666,828; 4,683,202; 4,801,531; 5,192,659 and 5 S 272 S 057; "Cell Biology: A 
Laboratory Handbook", Volumes I-III Cellis, J, E., ed. (1994); "Culture of 

15 Animal Cells - A Manual of Basic Technique" by Freshney, Wiley-Liss, N. Y, 
(1994), Third Edition; "Current Protocols in Immunology" Volumes I-III 
Coligan J. E., ed. (1994); Stites et al, (eds), "Basic and Clinical 
Immunology" (8th Edition), Appleton & Lange, Norwalk, CT (1994); Mishell 
and Shiigi (eds), "Selected Methods in Cellular Immunology", W. R 

20 Freeman and Co., New York (1980); available immunoassays are extensively 
described in the patent and scientific literature, see, for example, U.S, Pat. 
Nos. 3,791,932; 3,839,153; 3,850,752; 3,850,578; 3,853,987; 3,867,517; 
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3,879,262; 3,901,654; 3,935,074; 3,984,533; 3,996,345; 4,034,074; 4,098,876; 
4,879,219; 5,011,771 and 5,281,521; "Oligonucleotide Synthesis" Gait, M. J., 
ed. (1984); "Nucleic Acid Hybridization" Hames, B. D., and Higgins S. I, 
eds. (1985); "Transcription and Translation" Hames, B. D., and Higgins S. J., 

5 eds. (1984); "Animal Cell Culture" Freshney, R. I., ed. (1986); "Immobilized 
Cells and Enzymes" IRL Press, (1986); "A Practical Guide to Molecular 
Cloning" Perbal, B., (1984) and "Methods in Enzymology" Vol. 1-317, 
Academic Press; "PCR Protocols*. A Guide To Methods And Applications", 
Academic Press, San Diego, CA (1990); Marshak et al., "Strategies for Protein 

io Purification and Characterization - A Laboratory Course Manual" CSHL Press 
(1996); all of which are incorporated, by reference as if mlly set forth herein. 
Other general references are provided throughout this document. The 
procedures therein are believed to be well known in the art and are provided 
for the convenience of the reader. All the information contained therein is 

1 5 incorporated herein by reference. 

MATERIALS AND EXPERIMENTAL PROCEDURES 
Cell lines: 

The human cancer cell lines: PC3 (prostate cancer), UCI-107 and 
UCI-101 (both ovarian cancer), MDA-231 and MCF-7 (bom breast cancer) 
20 were obtained from the ATCC. The human B-cell line C1R was a generous 
gift from Nick Zavazava. UCI-107, UCI-101, MDA-231 and MCF-7 cells 
were maintained in DMEM containing 10 % FCS, 1 mM glutamine, 0.1 mg/ml 
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streptomycin and 100 units/ml penicillin. PC3 and C1R cells were maintained 
in RPMI 1640 containing 10 % FCS 5 1 mM glutamine, 0.1 mg/ml 
streptomycin and 100 units/ml penicillin. For growing MCF-7 cells without 
estrogen, the cells were maintained in DMEM without sodium pyruvate and 

5 phenol red and containing 4 % FCS Stripped of estrogen, 1 mM glutamine, 0, 1 
mg/ml streptomycin and 100 units/ra] penicillin. Culture media, and serum 
were obtained from GibcoBRL. 

RMA-S-HHD is a murine TAP-2 deficient lymphoma clone of 
C57BL/6 origin, transfected with HLA-A2.1/Db-|32m single chain (HHD) 

10 [23]. The RMA-S-HHD-B7.1 cells transfected by the murine B7.1 
costimulatory molecule (CD8Q), EL4-HHD is the murine lymphoma cell line 
EL4 transfected by HHD, RMA-S-HHD, RMA-S-HHD-B71 and EL4-HHD 
were maintained in RPM 1640 containing 10 % FCS and 1 mM glutamine, 
0.1 mg/ml streptomycin and 100 units/ml penicillin. After transfection, the 

15 cells were maintained in medium supplemented with 500 to 1000 pg/ml of the 
antibiotic G418 (GibcoBRL), 
DNA: 

Plasmid HLA-A2/Q10 b , used for expression of soluble MHC, contains 
the first five exons of the HLA-A2 fused to exons 5 to 8 of the murine mutant 
20 Ql0 b , which lacks a functional transmembrane domain and is therefore 
secreted. This plasmid was a generous gift from D, Margulies, of the NIH 
[24], Plasmid (php2m) was constructed to express the human 
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p2-microglobulin. It is based on the cDNA of human p2m (hp2tn) isolated 
from PC3 cells and amplified by PCR using the following primers; 5 '-sense 
primer; 5-AGATTCCCAAGCTTATGTCTCGCTCCGTGG-3' (SEQ ID 
NO:40) contained a restriction site for Hind III before the signal peptide and a 
5 3' antisense primer 5-AGCTAGTCTAGATTATCACATGTCTC 
GATCCCACTTAAC-3 r (SEQ ID NO;41) contained the restriction site for 
Xbal on the 3 1 end of p2m. The purified PCR product was cut with HindUl 
and Xbal and ligated into the eukaryotic expression vector pCDNA-3.1 
(Invitrogen). Plasmid sHLA-A2 and SHLA-B7 contains the cDNA of the first 
l o 4 exons of this alleles ligated into the plasmid pcDNA3 . 1 [34] . 
Antibodies and hybridomas: 

The hybridomas W6/32 and BB7.2, an anti-MHC class-I and 
anti-HLA-A2 respectively, and HB-149 an anti (32m were obtained from the 
ATCC The antibodies were affinity purified using protein A-Sepharose 
15 CL-4B (Sigma) from mouse ascites fluid. 

Transfection of cancer cells and selection of clones secreting sMHC: 
Cell lines were co-transfected with plasmid HLA-A2/Q10 b and with 
php2m, which conferred resistance to the antibiotic G418 or transfected only 
with the plasmids sHLA-A2 and sHLA-B7 that contained the antibiotic 
20 resistance. Cells were electroporated by use of a Gene Pulser (Bio-Rad) set at 
280-300 mV 960 |JF. Transfected cell clones were selected in G418 antibiotic 
and screened for those secreting sMHC to the growth medium. Secretion of 
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sMHC was assayed by sandwich-ELISA with plates coated with the mAb 
BB7.2 (for sMHC-A2) or by HB-149 (for sMHC-B7) and sMHC was detected 
with the biotinilated mAb W6/32. Color was developed with ABTS (Sigma) 
catalyzed by streptavidin peroxidase (Sigma). 
5 Affinity purification of soluble MHC; 

Cultured cells, expressing the soluble MHC were grown to conflueney 
in 150 mm plates. The culture medium was collected and residual cells were 
removed by cenlrif Ligation. Soluble MHC molecules were purified from the 
cleared culture medium by affinity chromatography on W6/32 antibody 

10 columns at 4 °C, The antibodies were coupled to NHS-activated agarose 
(Pharmacia) or to protein A Sepharose (Sigma) with n-methylpipelimidate 
(Sigma), The columns were washed with 0,5 M NaCl, 20 mM ? Tris pH 8. The 
MHC molecules were eluted from the column with 0,1 M acetic acid at pH 3. 
Peptides were separated from the MHC complexes by boiling for five minutes 

15 in 10 % acetic acid followed by ultra-filtration through a 3 kDa Microcon 
(Amincon) [2). 

Synthetic peptides: 

Peptides were synthesized on AbiMed AMS 422 multiple peptide 
synthesizer (Abimed, Laugenfeld, Germany) by Fmoc chemistry precipitated 
20 with ether and used with or without further purification (HPLC). 
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Peptides separation and analysis: 

The MHC bound peptides were resolved by rever$e-pha$e HPLC on a 
0.1 ID fused silica capillaries with length of about 30 cm (J&W) slurry packed 
with POROS 10 R2 (PerSepetive Biosystems). The capillaries were fitted with 

5 electrospray needle made from 36-gauge stainless tubing (Small Parts Inc. 
Miami Lakes, FL). A Rheodyne 9125 HPLC injector fitted with a 20 H l loop 
was used for loading the column. The peptides were resolved by a relatively 
long (90 minutes) linear gradient of 5 to 50 % acetonitrile with 0.1 % acetic 
acid, at a flow rate of about t ^1/minute, The flow was electrosprayed directly 

10 from the HPLC column into an ion trap mass spectrometer (LCQ, Finnigan). 
The mass spectrometry analysis was done in the positive ion mode, using 
repetitively a full MS scan usually between 450 to 1500 atomic mass units 
(amu) followed by collision-induced decomposition (CID) of the dominant ion 
selected from the previous MS scan. In some cases the full MS was performed 

15 with a narrower mass range to reduce the number of detected peptides. The 
peptides were identified by comparing their MS and CID data to the calculated 
MS and CID of the proteins in the Genpept databank 
(www.ncbi.nlm.nih.gov/genpept) using the Sequest software [25] (obtained 
from Finnigan, San Jose, CA). The number of times each peptide was 

20 fragmented by CID was usually limited to two by dynamic exclusion, a feature 
of the Xcalibur control software the LCQ mass spectrometer (Finni gan), 
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Stabilization of cell surface HLA-2J by peptide binding: 
RMA-S-HHD cells were washed three times with PBS followed by 
incubation overnight in FCS-free IMDM medium at 26°C. Synthetic peptides 
were added to 10 6 cells at a concentration of 100 fM. The cells were 
5 incubated for two hours at 26°C followed by two hours at 37°C. The 
stabilization of the HHD MHC by the peptides binding was measured by 
FACS analysis on Becton Dickinson FACStar flow cytometer after decorating 
the cells with W6/32 mAb at 4°C for one hour and then 30 min incubation with 
anti-mouse FITC at 4 D C (Sigma), 
l o Cytotoxic T lymphocytes assays: 

Transgenic mice expressing a single chain HLA-A2.1/Db-jJ2m which 
are double knockout for H-2Db and for p2m (HHD mice) [23] were 
immunized four times intra-peritonealiy at 7-day interval with 2 x 10^ 
irradiated (5,000 rad) RMA-S-HHD-B7.1 cells loaded for two hours at 26 °C 
15 followed by two hours at 37 °C with 100 \M of the synthetic peptides. Ten 
days after the last immunization the spleens were removed from the vaccinated 
mice. Splenocytes were re-stimulated with 100 \M of synthetic peptides for 
five days. Viable lymphocytes were separated by lympholyte-M (Cadarlane, 
Hornby, Canada) and resuspended in RPMI-HEPES. Cytotoxic activity was 
20 measured as in [26] by admixing the lymphocytes at different ratios with 5 x 
10 3 EL4-HHD cells grown in medium containing 35 S methionine and then 
loaded with the synthetic peptides. 



-MPY-2001 21:09 FROM: 



TO: 972 9 7706333 



P. 064 



63 

EXPERIMENTAL RESULTS 

In order to identify large number of MHC bound peptide antigens 
presented in the context of a particular MHC haplotype, different human cell 
lines were transfected with expression vectors for soluble, secreted MHC 

5 molecules. Indeed, different soluble MHC could be transfected into various 
cell lines resulting in enabling the recovery of large amounts of the soluble 
MHC molecules from the cell's growth medium. The sMHC molecules were 
recovered with their authentic patterns of peptides still bound and free of 
contamination by cellular debris and detergents. Prostate (PC3), ovarian 

10 (UCI-107) and breast (MDA-23 1 and MCF-7) cell lines were transfected with 
the DNA coding HLA-A2.1/Q10* or sMHC-A2 and sMHC-B7. Soluble 
MHC molecules were recovered from the culture medium without disrupting 
the cells and the sMHC molecules were purified by a single step of 
immunoaffmity chromatography. About 200 ng of the sMHC molecules were 

15 recovered from about 10 9 cells (Figures 1A-C), The MHC large subunit, the 
(J2m and small amounts of antibodies that were released from the 
immunoaffmity column by the acid treatment were the only proteins detected 
in the column eluant. The peptides were separated from the proteins subunits 
of the MHC by ultra- filtration. The recovered heavy subunit of the soluble 

20 MHC molecules was confirmed to be that of HLA-A2.1 by peptide mapping 
and by micro sequencing. 
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Sequencing of a large number of individual peptides was approached by 
electrospray tandem mass spectrometry. The peptides were partially resolved 
on homemade nano-capillary reversed phase columns interfaced directly to an 
electrospray mass spectrometer, The peptide mixtures were resolved by 

5 relatively long reversed-phase HPLC gradients on long capillary columns, 
enabling performing mass measurements and fragmentation of a large number 
of peptides. The mass spectra were recorded between 450 to 1500 mass units, 
which is the expected mass (m/z) range of the singly and the doubly charged 
MHC bound peptides. The mass spectrometry data included the 

10 total-ion-cuixent chromatogram (TIC, Figure 2A) and the mass spectrum of the 
peptides at each time point (Figure 2B). The mass spectrometer was 
programmed to repeatedly select the most abundant peptide observed in each 
spectrum and to fragment it by CID (Figure 2C), Peptides were identified by 
comparing their masses and the masses of their fragments to those calculated 

is for peptides derived from all the human proteins in the databank. The 
computer programs were instructed to search for putative peptides resulting 
from non-specific proteolysis since the specificity of proteases responsible for 
generating the MHC bound peptides in cells is not well defined. 

The relatively high sensitivity of the capillary ESI-MS/MS analysis and 

20 the large amounts of peptides recovered from the cells by use of the soluble 
MHC, enabled to perform multiple capillary HPLC separations with each 
peptide preparation. Peptides recovered from soluble MHC produced by about 
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5x1 0? cells were used for each capillary chromatography. Multiple 
chromatography runs enabled to detect those peptides that were observed 
reproducibly and to combine their CID data to improve the signal-to-noise 
ratio of the CID spectra. The combined and improved data sets were used for 

5 databank searches and peptide identifications. Using relatively long capillary 
columns (of above 30 cm) and long reversed phase gradients facilitated 
achieving high resolving power. Most peptides elute normally during 15 to 30 
seconds, which was a sufficient time for the mass spectrometer to analyze up 
to three different co-eluting peptides. The mass spectrometer was 

io programmed not to fragment any peptide more than twice in order to increase 
the total number of peptides analyzed during each chromatography. 

A total of about three thousands different peptides were sufficiently 
resolved and fragmented during the different chromatography runs of the 
mixtures eluted from the sHLA-A2 and SHLA-B7 recovered from the different 

1 5 cell lines. The large majority of the observed peptides were common to all the 
different cancer lines and only a small fraction was detected in only one of the 
cancer types. From this large number of detected peptides, about 200 were 
identified at high certainty to be derived from known proteins and the rest 
were not identified. Table 8 is typical list of such peptides recovered from the 

20 soluble MHCs and identified by the computer analysis. Among these peptides, 
fourteen were already known as MHC bound peptides. Those desired peptides 
that originate from putative tumor antigens were chemically synthesized to 
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further evaluate the accuracy of their amino acid sequences and to enable to 
study them as MHC bound peptides and their significance as cancer antigens. 
Their amino acid sequence accuracy was ascertained by running a 
chromatography of the synthetic peptides using the exact conditions 

5 immediately after the natural peptides mixtures and comparing the 
chromatography retention times, the exact masses and the CID spectra of the 
synthetic and natural peptides (Figures 3A-B). When synthetic peptides 
behaved identically to the natural peptides in these three criteria served as a 
clear indication that the identification was indeed correct. Twenty-seven of the 

10 most interesting peptides were chemically synthesized and confirmed to be 
correct by this assay, example of which is displayed in Figures 3A-B, 

Table 8 

List of peptides recovered from sMEC of different cancer cells and identified 
15 at high certainty by mass spectrometry 



Peptides from soluble HLA-A2 





Mass 


Sequence 
fSEOTDNO:) 


Protein 


Position' 


Score 2 


Scare 3 


Synthetic 4 


ftd 5 


1 


mA 


LLDVPTAAV(l) 


y FN inducible prOlein (IP-30) 


17-25 


159 9 


28 




[41] 


2 


1011.5 


LLLDVPTAAV(2) 


Y ("FN inducible protein (I.P-30) 


16-25 


1793.7 


31 




[41] 
[411 


3 


1210.4 


LIUWPTAAVQA(3) 


y IPN inducible protein (IP-30) 


16-27 


128A 


21 




4 


800 5 


OLLG71VQ(4) 


Beta catcnin 


400407 


0.2 


17 






5 


913.4 


CLLGTLVQL(S) 


Beta c&temn 


400-408 


181.7 


31 






6 


9223 


ALFGALFLA(6) 


Phospholipid transfer protein 


2-10 


24S2 


23 


+ 




7 


945 4 


SLLGGDWSV(7) 


TSG-22-ltke protein 


22-32 


591,9 


34 






8 


947.4 


NLTfSDVSV(S) 


MUCl 


130-138 


69,6 


23 


+ 


126] 


9 


958.3 


SLWQQPAEA(9) 


Human collagen type iV 


18-25 


41.2 


23 


+ 




10 


981.7 


SLlGHtQrt(tO) 


protein tyrosine posphatasc 


336-344 


49.1 


32 






11 




SLSEKTVLL(ll) 


CD59 


106-114 


87.6 


29 


+ 




12 


989.4 


$UWKLEV{12) 


Flightless I homolog 


1010-1* 


257.3 


30 


+ 




13 


1028.5 


GL1EKNJO03) 


DNA methyl transferase (MTDM) 


425433 


87.6 


28 






14 


1031.4 


GLYPGLIWL{!4) 


Interferon regulatory factor-6 


21-29 


864.*? 


30 




w 


15 


1038.5 


YLLPAJVHI05) 


RNAhclicase 


146-154 


408.4 


30 




\$ 


1068 4 


ALSDHWYUlS) 


Fructose bis phosphate aldolase 


316-224 


481.7 


23 




PV| 


17 


1071.5 


rLDQKINEV(17) 


ornithine decarfcpxylttfc 


23-31 


108 8 


30 




[96] 


IS 


1071.6 


»LOlCKVHKV(lS) 


.HumunHSP 90 beta, HSP *4 


570-578 


53.3 


29 




[74] 


19 


1080.4 


SLLPPTALVGU(t9) 


H. Transporter SEC23A 


15$-I64 


181.8 


33 






20 


1091.4 


GVYDGEEHSV(20) 


MAGE-B2 


231-240 


79 9 


20 
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21 


1094,5 


SUPPDALVGL(2I) 


hi, Transporter SEC238 


150-160 


18LS 


33 






22 


1121.3 


TLWVDPY£V<22) 


B- cell translocation gene (BTGl) 


103-1 J 1 


577,3 


24 


+ 


m 


23 


1145,4 


aFDGSPTYV(23) 


Fatty acid synthase (FAS) 


292-301 


26&>4 


23 






24 


J25S5 


FLFDCSPTYVL(24) 


Fatty &cid synthase (FAS) 


292-302 


611.2 


27 






25 


1360,4 


ALWDIET<jlOOrV(25) 


guanine nudeoride-binding 


167-178 


236a 


28 






Peptides from soluble HLA-B7 




Mass 


Sequence 


Protein 


Position^ 




Score^ 


aynfocuii 


Pn^5 

iter 




(m/z) 


(SEQJD NO:) 














i 


S54.3 


VP$£PGGVL<26) 


70 IcDaSHML 


422-30 


120 


27 






2 


8S3.4 


SPTQPIQC(27) 


cell membrane glycoprotein 


257-61 


SO 


20 












U 0000 Mr 












3 


S«4 


SPALPGLKL(2S) 


transmembrane activator and 


147-55 


120 


27 


+ 










CAMt interactor 












4 


899.5 


APRTVALTA(29) 


HLA-S.B beta 


9-17 


60 


24 




(75) 


5 




SPKLPVSSL(30) 


DMA binding protein homolog 


372-80 


120 


25 






6 


9S9 3 


(CFSLPFTSLO!) 


translation initiation cotton 


79-87 


120 


2* 


+ 




7 


9».5 


LVMAPRTVL(32) 


MhC clflss-I 


2-10 


135 


18 




[75) 




1050.4 


KPAfPAEKL(33) 


annexm Al 


274-82 


80 


n 






9 


1075.4 


SFYQNIfClL(34) 


spermidine aminopropyltr&nsferase 


128-36 


SO 


20 






10 


1 104.5 


AASKERSGVSL(35) 


Histonc HI 


50-60 


36 


n 




US] 


U 


1114.3 


AJWPLASOIL(36) 


precursor 


2-12 


240 


22 


+ 




12 


11&4.5 


A?SO$LAVPLAVL(37) 


hypothetical protein 


9-21 


360 


31 







Table 8: An example of MHC bound peptides that were identified by the 
5 Sequest software [25] (obtained from Firmigan, San Jose, CA) after mass 
spectrometer analysis. ^Position of the first and the last amino acid of the 
peptide. ^Cdlcuht^d score, estimating half the time for dissociation of the 
peptide-MHC complex [42]. 3 Calculate score, 4 sequence approved by 
analyzing in comparison a synthetic peptide. 5 Peptide is known. 

10 Among the many peptides derived from different housekeeping proteins 

and enzymes, some peptides were determined to be derived from known tumor 
associated antigens. These include mucin (MUC-1) and MAGE-B2 while 
others were derived from proteins whose level is known to be significantly 
elevated in cancer cells such as beta-catenin, DNA methyl transferase and fatty 

15 acid synthase (Table 8). 

A comparison in the patterns of peptides presented by the same MHC in 
cell lines of different tissue origin enabled the identification of those peptide 
uniquely presented in only cells of a particular tissue origin. Only a few of the 
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peptides were determined to be unique to specific cell lines while most of the 
peptides that were observed in all the different cell lines were derived from 
normal cellular proteins. Also, significantly different patterns of peptides were 
recovered from sHLA-A2 and from sHLA-B7. Examples for unique peptides 
5 which are displayed in Table 9 include peptide p922 (phospholipid transfer 
protein) recovered only from PC3 cells and peptide p947 (SEQ ID NO;8) 
(MUC1) recovered only from MCF-7 grown without estrogen. Peptide p945 
(SEQ ID NO:7 ? derived from TSC-22-1ike protein) is a novel peptide that was 
detected at high level in the two-breast cancer ceils (MCF-7 and MDA-231), 

10 but was not observed in the ovarian (UCI-107) and the prostate (PC3) cancer 
cells. Peptide p981 (SEQ ID NO: 10) originated from protein tyrosine 
phosphatase, and was detected only in the breast cancer cell MDA-231. One 
of the most interesting novel peptides identified was pi 091 (SEQ ID NO:20) 
derived from the tumor antigen MAGE-B2. The peptide was detected only in 

15 the ovarian cancer cells (UCI-107) and not in the other cell lines. The 
synthetic and natural peptides elution pattern and CID spectra of both were 
identical (Figures 4A and 4B), The binding affinity of this peptide to the 
MHC molecules was determined to be normal as assayed by reconstitution and 
stabilization of empty MHC on the surface of RMA-S-HHD cells (Figure 4C). 

20 This peptide is derived from the same region in the MAGE proteins, as do 
other previously identified MHC bound peptides derived from MAGE-A4 and 
from MAGE- A 10 [27, 28] (Figure 4D). 
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Table 9 

Comparison of MHC peptide patterns between cell lines of different cancer 

origin 









Sequence 


MCF-7 


1 




FC-3 




UCJ- 




(m/i) 






without 


231 




107 


101 


I 


898.4 








+ 




4" 








1 1 1 .PiVPTA AW? 1 * 




■4* 










3 


1210 5 






+ 






+ 


+ 


4 


800 $ 












+ 




5 


913.4 










+ 






6 


9223 


A_LFGALFLA(6) 
















942.4 


SLLtj(*iDVVSVf71 














g 


947.4 


NLTT$DV$V{8) 














0 


95 8,3 


Si WGOPAFA^ 










+ 


+ 


10 


981.7 


SLIGHLQTL(IO) 






+ 








1 1 


989.5 


<l SRKTV1 f /111 












•1" 


















+ 


Id 






*b 


•1* 










14 


1031 4 


OLYPGLIWU14) 






+ 






+ 


15 


1038.5 


YLLPA1VH10S) 


+ 


+ 


+ 


+ 


+ 


+ 


16 


1068 4 


ALSDHHJYL(16) 


+ 












17 


1071.5 


ILDQKINEV(17) 




+ 


+ 




+ 




18 


1071.6 


ILD(CKVHKV(18) 




+ 






+ 




19 


1080 4 


SLLPPTALVGL(19) 






+ 




+ 


+ 


20 


1091,4 


GVYDGRP.HSV(2d) 














21 


1094.4 


SLLPPDALVGL(21) 


+ 


+ 


+ 




+ 


+ 




1121,3 


TLWVOPYBV(J2) 






+ 


+ 


+ 


+ 


23 


1145.4 


FLFDGSPTYV(23) 


+ 












2-1 


1258.5 


aFDG$PTYVL(24) 






+ 




+ 


+ 


23 


1360.4 


ALWD3ETGQQTV(2S) 






+ 




+ 





5 





Mass 


Sequence 


cm 




UC1-10 






(SEOIDNO:) 




31 


7 


1 


854.3 


VPSEPGGVK26) 








2 


883.4 


SPTQP1QM27) 




+ 




3 


895 4 


SPALPGLKL(28) 








4 


899.4 


APftTVALTA(29) 


+ 






5 


999 5 


SPKLPVSSL(30) 




+ 


+ 


6 


927,3 


fcPSLPFT$l{3!) 






+ 


7 


9893 


LVMAPRTVL(32) 










1050.4 


KPAFFAEKL(33) 










1075.4 


$PYQN1KJt(34) 








10 


1104 5 


AAS1CERSGVSL(35) 


+ 






11 


1114,3 


APFePUSOTL(36) 






+ 


12 


1194.5 


APSGSLAVPLAVU37) 




+ 





Table 9: (A) The appearance of peptides from soluble HLA-A2 in breast 
cancer cells MCF-7 and MDA-231, 1 MCF-7 that grown without estrogen, 
prostate cancer cell PC-3 and the ovarian cancer cells UCI-107 and UCI-101. 
10 (B) The appearance of peptides from soluble HLA-B7 in B cell leukemia 
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cancer cells C1R, breast cancer cells MDA-231 and ovarian cancer cells 
UCI-107. 

Another approach to ascertain that the identified peptides were indeed 
MHC bound peptide antigens, their capacity to bind tightly and stabilize cell 
5 surface HLA-A2J was tested by ^constitution into empty MHC on the 
surface of RMA-S-HHD cells. Binding was assayed by FACS analysis after 
decorating the cells with the fluorescent anti-intact MHC mAb W6/32 (Figure 
5). Nine of the synthetic peptides were determined to stabilize cell surface 
MHC significantly more than without the added peptides and to a similar 

10 extent as peptide (G9-209-2M) IMDQVPFSV (SEQ ID NO;42) 5 derived from 
the melanoma protein gp-100 [29]. 

To further evaluate the affinity of the peptides to the HLA-A2 and to 
obtain some insight into their immunogenic potential, selected peptides were 
tested for their ability to induce an immune response in HLA-A2 transgenic 

15 mice. It was assumed that only peptides that could be effectively presented 
and remain tightly bound to the cells would be capable of inducing an immune 
response in these mice. The same synthetic peptides that were used for the 
FACS analysis were used for immunization of the HHD transgenic mice, 
which express the human HLA-A2.1/Db- (32m single chain. To immunize the 

20 mice, the HHD culture cells were loaded with the different peptides and then 
injected to the HHD mice. The immune response in the mice was followed by 
the appearance of CTLs specific for these peptides. The lysis patterns of the 
target HHD cells by T-cells taken from the immunized mice are shown in 
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Figure 6. Some of the peptides were indeed capable of inducing an immune 
response, which both authenticate them as MHC bound peptides and gives an 
indication about their immunogenic potential. The CTL results demonstrate 
significant lysis of EL4-HHD cells loaded with the peptides p!028 (SEQ ID 

5 NO:13) from DNA methyl transferase, pl258 (SEQ ID NO:24) from fatty acid 
synthase, pi 121 (SEQ ID NO:22) from B cell translocation gene (BTG) and 
pl068 (SEQ ID NO: 16) from aldolase as compared to the negative control 
peptide ALLCAPSLL (SEQ ID NO;43). 

SUMMARY OF PEPTIDE INFORMATION FOR SOLUBLE HLA-A2 

10 The following provides a summary of peptide information so far 

collected for eptides bound to soluble HLA-A2 using the method of the 
present invention. 

The following notations are used herein: 

group number until May 7, 2001 
mass of the natural peptide 
mass of the identified, peptide 
observed retention time of the natural peptide 
calculated retention time of me identified pepLide 
calculated internal score 
adherence to HLA-A2 consensus motif 
adherence to RLA-B7 consensus motif 
identified peptide sequence 
protein from which sequence is derived 
location of peptide in protean 
link to protein information in GenBank 
previously known peptide 

Cell lines: 



30 #D; PC3+A2/Q10 

#E: MCF7+A2/Q10 

#F: MDA-231+A2/Q10 

#EST: MCF7(with estrogen) +A2/Q10 

#£*R; MCF7 (without estrogen) +A2/Q10 

35 #G: UCI-107+A2/Q1O 

flH: C1R+SB7 

#1: UCI-107+SB7 



15 G: 

Mg: 

Mp: 

Tg: 

Tp: 
20 S: 

A2: 

B7: 

P: 

PR; 
25 POB : 

genpept : 

ref : 
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#J; MDA-231+SB7 
#K: UCI-101+SA2 

#L: 2780 (ovarian cancer cell lme}+sA2 

#S; synthetic peptides 

5 

G=1990: Mg= 800.4; #5+ (2,1) #G+ (10, 7) S=83 (37, 74 ) Mg° 800,5 Tg=38+-0 
Tp= 53 Mp=800.5 A2= 0.02/18 P-GLLGTLVQ 

genpept PR=*>gi f 860588 | emb I CAA61107 , 1 1 (X87838) beta-catenin 

[Homo sapiens] POS=399 (SEQ ID NO: 4) 

10 

G=1234: Mg= 810.3: Tg=31+-1 #D+{2,2) #E+(7,4) #F+(4,3) ttEST+(7,2) 
#FR+{4,2) #G+{33,11) #K+(3,2) #L+(2,1) 

S=84(87,79) Mp= 810.2 (-0,1) Tp=> 34 A2= 11/29 P=ALAPGLPTA 
genpept PR=>gi|5771535|gb|AAD51419.1|AFl73937J. (AF173937) secreted 
15 protein of unknown function [Homo sapiens] POS=21 (SEQ ID NO: 44) 

G=1251: Mg= 811.4: Tg-35+-Q #G+(S,3) 

S=96(96,99) Mp- 811. 5( 0.1) Tp= 36 A2=465/26 P=KLL£PVL 
genpept Pfc°>gi \ 338447 | gb IAAA60583 . 1 | (M6Q354) RP5I6 [Homo sapiens) 
20 PQS=50 (SEQ ID NO:45) 

G=1378: Mg= 841.3; Tg«41+-0 #G+{9,5) 

S=77(83,66) Mp= 841,4 ( 0.1) Tp* 40 A2=0.0/16 P=SLLPAIVE 
genpept PR->gi 1 189428 I gb| AAA36399.1 1 (J02902) phosphatase 2A 
25 regulatory subunit [Ho;no sapiens] PQS=403 (SEQ ID NO: 46) 

G=14l9: Mg° 848.3: Tg=34-i-l #E+(2,1) #F+(2 f l) #G+(5,5) #K+(1,1) 
5=84 (83,89) Mp= 848.4 ( 0.1) Tp= 34 A2= 52/26 P^SVLGSLSSV 
genpept PR=>gi | 5833114 | gb I AAD534Q1 . 1 \kt 107840 jL (AF107840 ) nuclear 
30 pore-associated protein [Homo sapiens] POS°280 (SEQ ID NO: 47) 

6=1420: Mg= 848 .4: Tg=>37+-1 #D+(2,2) #E+(11,6) #F+(8,5) #EST+{3,2) 
#Fft+{4,2) #G+ (25,11) 

S=95{94,99) Mp= 848. 4( 0.0) Tp= 39 A2=U8/28 P=LLGPPPVGV 
35 genpept PR=>gi 1 10436195 1 dbj j BAB14750 . 1 i (AK023978) unnamed protein 
product [Homo sapiens] POS=159 (SEQ id ^0:48) 

G«1439: Hg- 852.3: Tg=22+-2 #FR+(2,1) #1+ (32, 6) 

3=63(81,33) Mp= 852.0 (-0.3) Tp= 25 A2-0.0/1 P-]?Grri?Fri?l? 
40 genpept PR=>gi | 5689367 I db j I BAA82967 . 1 1 (AB021227) membrane- type- 5 
matrix metalloproteinaae [Homo sapiens] POS=ll (SEQ ID NO; 4 9) 

G=>1492: Kg= 860.3: Tg=37+-1 #G+(9,5) 

5=81(87,69) Mp= 860.3 ( 0.0) Tp« 31 A2-I16/28 P=SWSGPLIGV 
45 genpept PR=>gi 1 14 6918 9 I dbj I BAA094 82 . 1 1 (D50923) The KIAA0133 gene 
product is novel. [Homo sapiens] P0S«629 (SEQ ID NO: 50) 

G=1510: Mg^ 862,5; Tg=>29+-2 #F+(4,4) #G+(4,2) 

S=78(37,59) Mp= 862,2 (-0.3) Tp= 27 A2-116/32 P=SMAPGLT$V 
50 genpept PR=>gi 1 1248 4559 j gb] AAF20366. 2 I AF1507 54JL (AF1507 54) 

3'phosphoadenosine 5 ' -phosphosulfate synthase 2b isoform [Homo 
sapiens] 3?OS=542 (SEQ ID NO:51J 

G=1540: Mg« 968.4: Tg=43+-0 #E+(2,1) #F+(7,<1) #FR+(l f l) *G+(4,4) 
55 #K+<3,3) 
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S=83(B9,69) Mp= 668.4 ( 0,0} Tp= 46 A2= 19/30 P=LLIPGLATA 
qenpept Pfc=>gi 1 2274974 I emb I CAA57489.il (X819O0) NADH oxidoreductase 
subunit MWFE [Homo sapiens] P0S=16 (SEQ ID NO: 52) 

G=1563; Mg- 871.3: Tg=36+-1 #D+(l r l) #E+ (8,6) #F+{4,3) #EST+(3,2) 
#FR+(5,2) #G+{15,9) 

S=86(89,79) Mp° 871.4 ( 0.1) Tp° 33 A2°592/33 P^GLLGNVAEV 
qenpept PR=>gi 1X2655181 |gb|AAH01447.X I AAH01447 (BC001447) Similar to 
ZYG homolog [Homo sapiens] POS=10 {SEQ ID NO: 53) 



G^157S: Mg= 872.4; Tg°32+-3 #D+(2,1) #E+(3,3) #F+(24,7) fEST+(3,2} 
#FR+(4,2) #G+(12,7) 

3=79(76, 88) Mp= 872 , 5 { 0.1) Tp= 33 A2= 11/26 P=SLIKLVEA 
genpept PR=>gi I 7020538 | dbj | BAA91170 . 1 1 (AK000444) unnamed protein 
15 product [Homo sapiens] POS°277 (SEQ ID NO: 54) 

G-1606; Mg= 876.4; Tg=28+-2 #E+(2,X) #F+(2,1) #EST+(1,X) #FR+(4 f 2) 
S-84(82,89) Mp=* B76. 31-0.1) Tp= 32 A2=201/3l P«GLA£SVSTL 
qenpept PR=>gl 1 126527 33 I gb | AAH0G1X6 . 1 1 AAH00116 (BC000116) Similar to 
20 KIAA0174 gene product [Homo sapiens] POS=95 (SEQ ID NO: 55) 

G=1621: Mg<= 878.3: Tg=40+-1 #E+(6,4) #F+(4,2) #EST+(X,X) #FR+{3,2) 
#G+(11,8) #K+(l f l) 

S~88(92,79) Mp° 878. 4 ( 0.1} Tp= 40 A2- 55/30 P=ATIGGTFTV 
25 genpspt FR=>gi | 6330243 I dbj | BAA86495 . 1 ! (AB033007) KIAA1X8X pxotein 
[Homo sapiens] POS=304 (SEQ ID NO: 56) 

G-1637: Mg= 880.3: Tg=37+-0 #H+{5,3) 

2=88(89, 88) Mp= 880- 4 ( 0.1) Tp= 40 A2= 2/23 P-IITGPAPVL 

30 genpept PR°>gi|7542357|gb|AAF63417.1|AF142422_l (AF142422) QUAKING 
isoform 3 [Homo sapiens] POS-250 (SEQ ID NO: 57) 

G=1655: Wg= 882.3: Tg=43+-1 #F+(l,X) #G+(6,4) #K+{X,X) 
3^87(87,88) Mp- 862. 3( 0.0) Tp= 43 A2<=G.0/18 P=SFDGWATV 
35 gonpapt PR=>gi 1 726394 4 I emb | CAB81773 . 1 1 (AJ276359) mucin 4 [Homo 
sapiens] POS°1560 (SEQ ID NO: 58) 

G=1732: Mg= 894.4: Tg=42+-0 #G+(6,3) 

S^79(82,72) Mp= 894.4 ( 0.0) Tp= 41 A2° 2/20 P=LPPDALVGL 

40 genpept PR°>gi 1 1296666 1 emb i CAA65775 . 1 1 (X97065) Sec23 protein [Homo 
sapiens] P0S=158 (SEQ ID NO: 59) 

G=1737: Mg= 895.3: Tg=14+-1 #G+[4/2) 

3=76(79,72) Mp= 895, 4 ( 0.1) Tp= 22 A2= 47/25 P=ILDAGGHNV 
45 qenpept PR=*>gi 1 1808578 | dbj ! BAA0 7918 . 1 1 (D44466) protsasome subunit 
pll2 [Homo sapiens] POS«736 (SEQ ID NO: 60) 

G=1744: Mg= 896.3: tg-28+-3 #D+(6,3) #E+(14,5) #F+(20,6} 
#EST+(5,2) #FR+<6,3) #G+{47 # XX) #K+(8,4) 
50 S<=92(98, 81) Mp= 896.4 ( 0,1) Tp= 30 A2=512/30 P=GLYSGVTTV 
qenpspt PR->gi 1 36065 f emb I CAA42118.il (XS9543) Ml subunit of 
ribonucleotide reductase [Homo sapiens] POS=46 (SEQ ID N0:6l) 

G=1745: Mg= 896.3: Tg<=55+-0 #G+{8,4) 

55 3=79(76,86) Mp= 896. 5( 0.2) Tp= 60 A2=>Xk/24 P=FLYPFPL 

genpept l?ft=>gi | 436224 | dbj |BAA05062. X I (D26067) KIAA0033 [Homo 
sapiens] POS=185 (SEQ ID NO: 62) 
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G=1768: Mg= 898.4: Tg=3S+-l #D+(11,3) #E+(50,10) #£-+{13, 7) 
#EST+{7,2) #FR+(13,3) #G+(63,11) #K+(l5,6) #L+(9,3) 

S~81{78,89) Mp= 898.4 { 0.0) Tp« 36 A2= 47/28 P=LLDVPTAAV 
5 genpept PRogi j 6165618 I gb i AAFOS 618 . 1 1 AF097362_1 (AF097362) 

gamirta-interferon inducible lysosomal thiol reductase [Homo sapiens] 
POS=27 (SEQ ID NO:l) ref 

G=1770; Mg= 898.4: Tg=38+-1 #E+(4,2) #F+(3,2) #FR+(1,1) #G+(1B,11) 
10 #K+(2,1) 

S=88(92,79) Mp^ 898, 3 (-0,1) Tp= 41 A2= 79/29 P=ALLFSSPTL 
genpept PR=>gi 1 1737205 1 gb | AAB38876 . 1 1 (U75276) TFIIB related factor 
hBRF (Homo sapiens] POS-609 (SEQ ID NO: 63) 

15 G-1786: Mg= 899.5: Tg=26+-2 #F+{2,2) 4lEST+{4,2) #FR+(5,2) 
#G+(32,8) #K+{8,5) #L+(1,1) 

S=93(96,89) Mp= 899.5 ( 0,0) Tp= 27 A2=^243/25 P-KLGSVPVTV 
genpept PR->gi i 12653653 | gb I AAH00609. 1 | AAH00609 (BC000609) KIAA07 38 
gene product [Homo sapiens] POS=623 (SEQ ID NO: 64) 

20 

G-1795: Mg= 900.4: Tg=53+-0 #D+(2,2) #E+(7,5) #F+(11,6) #EST+(2,1) 
#FR+(6,3) #G+(2l,ll) #K+(16,6> 

5=81(86,72) Mp= 900.5 ( 0.1) Tp= 55 A2=lB2/33 P°ALFPGVALL 
genpept PR=>gi I 2245365 | gb I AAC5153-8 . 1 1 (075885) ER-60 protein [Homo 
25 sapiens] POS^7 (SEQ ID NO: 65) 

G=1802: Mg° 901.3: Tg=33+-2 #E+(3,2) #G+(2,1) 

5^85(92,69) Mp° 901.5 ( 0.2) Tp= 32 A2=160/29 P-GLVGSLQEV 
genpept PR=>gi 1 11967711 1 ernb 1 CAC19484 . 1 1 (AJ278357) Tsg24 protein 
30 [Homo sapiens] POS=56 (SEQ ID NO: 66) 

G*1804: Mg= 901.4: Tg=31+-2 #E+(1,1) #FR+(1,1) #G+<6,3) 
S=90{95,79) Mp= 901.3(-0.1) Tp*= 22 A2= 2/18 P=APLSDTAQV 
genpept PR=>gi 1 104387 89 [ dbj ! BAB1534 4 . 1 1 (AK026063) unnamed protein 
35 product [Homo sapiens) POS=197 (SEQ ID NO: 67) 

G=1804: Mg= 901.4: Tg=31+-2 #E+(1,1) #FR+{1,1) #G+(6,3) 
S=89{94,79) Hp= 901. 5( 0,1) Tp= 36 A2=160/33 P=SLASI»LAKV 
genpept PR->gi I 848 9831 1 gb I AAF75772 . 1 1 AF265555JL (AF265555) 

40 ubiquitin-conjugating BIR-domain enzyme APOLLON [Homo sapiens] 
POS-123C (SEQ ID NO: 68) 

G-1822: Mg- 903.3: Tg=l6+-7 #D+(7,3) #E+(14,4) #F+(14,4) 
#EST+(18,2) #FR+(13,3) #G+(116,10) #K+(2Q,6) 
45 S-92(98,79) up- 903, 4( 0.1) Tp= 23 A2=l60/29 P-GIATDVQTV 
genpept PR=>gi 1565647 | dbj |BAA05 645 . 1 | (D26598) proteascme subunit 
HsClO-II [Homo sapiens] POS=55 (SEQ ID NO: 69) 

G=1860: Mg= 907.5: Tg-39+-l #D+(12,4) #£+(1,1) #F+<9,5) tt£ST+(3,l) 
50 #FR+<4,2) #G+(10,6) #£+{4,3) 

3-88(91,81) Mp- 907. 5( 0.0) Tpo 37 A2» 79/31 P=SLFGGSVKL 
Q enpept PR=>gi i 13375569 I gb | AAK20398 . 1 [ AF349951_1 (AF349951) HP95 
[Homo sapiens] POS°103 (SEQ ID NO: 70) 

55 G=1861: Mq= 907.6: Tg=38+-1 #EST+ (2, 1) UFR+{1,1) 
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Sa87(87,89) Mp° 907 . 6 ( 0.0) Tp° 32 A2= 21/19 P-KVGPVPVLV 
genpept PR=>gi 1 12304623 1 gb I AAH01734 , 1 ( AAH01734 (BC001734 ) protein 
translocation complex beta [Homo sapiens! POS-67 (SEQ ID NO: 71) 

G<=>1899; Mg° 910.3: Tg=46+-1 #E+<8,5) #F+(21,7) #EST+(4,2) 
ttFR+(7,3) #G+(24,11> #K+(l3,6} #L+(3,2) 

S^72{65,89) Mp= 910-4 ( 0.1) Tp= 40 A2=l82/32 P=GLLPDVPSL 
qenpept PR=>gi ! 13623421 1 gb I AAHQ6309 . 1 1 AAH06309 (BC006309) Similar to 
RIKEN cDNA 5730589L02 gene [Homo sapiens] POS=141 (SEQ ID NO: 72) 



G=1901: Mg= 910,4: Tg=39+-0 #D+(2,1) #E+(5,2) #F+(9,5) #EST+{1,1) 

#fr+(i,D 

3-80(90,59) Mp= 910.4 ( 0.0) Tp<* 41 A2==160/30 P=ALPPVLTTV 

qenpept PR°>gi [ 3882183 | dfoj | BAA34451 , 1 1 (AB018274) KIAA0731 protein 
15 [Homo sapiens] POS=l3l (SEQ ID NO: 73) 

G°1904: Mg- 910.4: Tg=38+-1 #E+{2,1) #F+(3,2) #£ST+(1,1) #FR+{3,2) 
#G+(3,3) #K+(2,1] 

S=90(95,79) Mp= 910.5 ( 0,1) Tp= 32 A2= 52/24 P=GVLFNIQAV 
20 qenpept PR«>gi| 7264004 |emb|CAB8l656. II (AL049822) dJl60A22.4 
(histone H2A) [Homo sapiens] POS°107 (SEQ ID NO: 74) 

G=1922: Mg- 912.5: Tg-42+-l #E+(5 # 4) #F+<2,1) #FR+(1,1) #G+(3,2) 
*K+(1 ( 1) 

25 S=78(83,69) Mp= 912. 5( 0,0) Tp= 43 A2- 49/31 P=ALTPWVTL 
qenpept PR=>gi 1 13177739 I gb I AAH03644 . 1 1 AAH03644 (BC003644 } 

cyclin-dependent kinase 4 [Homo sapiens] 1 POS=170 (SEQ ID NO: 75} 

G-1931: Mq= 913.4: Tg-=34+-l #£+(9,4) #F+(2,2) #£S?+(3,2J #FR+(1,1) 
30 5=84(96,59) Mp= 9l3.3(-0.1) *p- 29 A2= 70/27 P-ALNPADXTV 
qenpept PR=>gi | 6634421 1 emb I CAB64373 , 1 1 (AJ238375) putative protein 
TH1 [Homo sapiens] POS=103 (SEQ ID NO: 76) 

G°1933: Mg= 913.4: Tg=49+-0 #S+(12,2) #D+(17 r 5) #E+(16,8) 
35 #F+U8,7) #EST+(2,1) #FR+(4,2) #0+122,11) #H+{1,1) #K+(10,6) 

5=93(95,89) Mp= 913. 6{ 0,2) Tp= 46 A2=182/3l P=GLLGTLVQL 
qenpept PR*>gi | 38520 I emb I CAA7 94 97.1| (Z19054) beta catenin [Homo 
sapiens] POS=400 (SEQ ID MO:5) 

40 G=l939: Mg= 914.4: Tg*40+-0 #G+(4 f 3) 

S=82(79, 89) Mp- 914-4 ( 0.0) Tp- \1 A2=0.0/16 P-DASGLALLL 

qenpept 3?R=>gi 1 1060907 | dbj I BAA11242 . 1 1 (D78177) guinolinate 

phosphoribosyl transferase (Homo sapiens] POS=2 (SEQ ID NO: 77) 

45 G=1942: Mg° 914.5: Tg=16^-3 #F+(4,1) #G+(6,3) 

5=90(95,79) Mp= 914.4 (-0.1) Tp= 27 A2=160/29 P-SLTGHISTV 

qenpept PR=>gii 2832296 | gb IAAD09407 . 1 | (AF044333) pleiotropic 
regulator 1 [Homo sapiens) PQS=241 (SEQ ID NO: 78) 

50 G*1948: Mq= 915.5: Tg=38+-0 #D+(2,1J #F+(11,7) 

3=91(97,77) Mp- 915. 6( 0.1) Xp- 42 A2-0.5/15 P=VHVLTFTV 
qenpept l?R=>gi I 3242214 I emb 1 CAA07243 . 1 1 (AJ006778) DRIM protein [Homo 
sapiens] POS=1898 (SEQ ID NO:79) 

55 G-1974: Mg= 918.3: Tg=36+-1 #F+(3,2> #G+(19,10) 
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5=84(68, 77) Mp= 918. 6 ( 0.3) Tp= 34 A2= 6/25 P=SLKYVPLV 
genpept PR°>gi 1 10436278 ! dbj I BAB14783 , 1 i (AK024024) unnamed protein 
product [Homo sapiens] POS°248 (SEQ ID NO: 70} 

5 G=1979: Mq= 918.6: Tg=53+-0 *E+(5,3) #F+(6,3) #EST+(1,1) #FR+(4,2) 
#G+(4,2) #K+(4,3) 

S=81{84,74) Mp^ 918.5 (-0,1) Tp= 54 A2=0.8/16 P^LPYWGVAL 
genpept PR=>gi 1 7023639 1 dbj 1 BAA92035 . 1 1 (AK002014) unnamed protein 
product [Homo sapiens] POS=272 (SEQ ID HO: 71) 

10 

G=1988: Mg= 920.3: Tg<=32+-1 #£+{2, 2) #F+(S,2) #FR+(1,1) #G+(35,11) 
#K+<1,1) 

S=>89{90, 89) Mp= 920. 3{ 0*0} Tp= 27 A2° 31/24 P-SIYPSPTGV 
genpept PR=>gi | 3661610 I gb] AAC61776 . 1 1 (AF092565) splicing factor 
15 Prp8 [Homo sapiens] POS=l693 (SEQ ID NO: 72) 

G=2008: Mg= 922.3: Tg=58+-1 #S+{9,1) #D+(l7 r 5) #L+{8,2) 
S=79(80,77) Mpo 922, 5( 0.3) Tp« 59 A2-245/22 P«ALFGALFLA 
genpept PR=>gi 1 2653432 1 dbj i BAA23647.il (AB005297) BAI 1 [Homo 
20 sapiens] POS-1163 (SEQ ID NO: 6) 

G=2023: Mg= 924.2: Tg=15+-14 #F+(6,3) #Fft+(l,l) 

S=83(86,79) Mp= 924.4 ( 0.2) Tp= 33 A2° 11/24 P«ALASHLIEA 
genpept PR~>gi|7212807|gb|AAF40470.1|AF181263J. (AF161263) EH domain 
25 containing 2 [Homo sapiens] POS^507 {SEQ ID 5b; 73) 

G=2027: Mg= 924.5: Tg=13+-1 #G+(3,2) 

S~83(85,79) Mp- 924.4 (-0,1) Tp= 20 A2 = 75/24 P=KLGPAPKTL 
genpept PR=>gi | 408198 I gb| AAB2 7 691 , 1 1 {S64671) DNA-binding 

30 protein/plasminogen activator inhibitor- 1 regulator [human ; HeLa S3, 
Peptide Partial, 176 aa] [Homo sapiens] POS=133 (SEQ ID NO: 74) 

G=5029: Mg= 924.5: Tg=43+-1 #F+(1,1) fG+(17,8) 

S-93(91,99) Mp° 924.6 { 0,1) Tp^= 4 4 A2->lk/27 EH5CLLEPVLL 
35 genpept PR=>gi | 336447 [ gb [AAA60563 . 1 i (M60854) RPS16 [Homo sapiens] 
POS=50 (SEQ ID NO: 75) 

G=20SO: Mg^ 926.5: Tg=14+-3 #F+(8,2) #EST+(1,1) #FR+(3,2) #G+(6,3) 
5=90(96,79) Mp= 926.4 (-0,1) Tp= 29 A2=> 78/30 P=AIiSGHLETV 
40 genpept PR=>gi | 12314197 | emb I CAB99342 . 1 | (AL13900S) bA255A11.8 (novel 
protein similar to annsxin A2 (ANXA2) (lipocortin II, calpactm I 
heavy chain, chromobindin 8, PAP-IVJ ) [Homo sapiens] POS=90 (SEQ ID 
NO:76) 

45 G=2068: Mg= 929.5: Tg=43+-1 #E+(2,2) #F+(20,7) #FR+(1,1) 
#G+(36,11) #K+ {24, 6) #L+(3,2) 

5=92(90,99) Mp- 929-5 ( 0.0) Tp= 31 A2-173/25 P=SLLDKIIGA 
genpept PR=>gi| 11034 809 Igb I AAG270 93.1 1 AF312393JL (AF312393] 

leucine-aipper protoin FKSG13 [Homo sapiens] POS=56 (SEQ ID NCi77) 

50 

G=2071; Mg= 930.3: Tg=35 + -1 #F+{4,3> #G+(l3,7} #K+{3,2) 
S=91<97,77) Mp^ 930.4 ( 0.1) Tp= 33 A2^257/33 P-GLLGAGGTVSV 
genpept PR=>gi 1 114 93522 I gb j AAG35534 . 1 1 AF130ll7_68 (AF130109) PR01512 
[Homo sapiens] POS^17 (SEQ ID NO: 76) 

55 

G=2072: Mg= 930.4: Tg=53+-0 #D+{8,4) #£+(21,8) #F+(10,5) 
#EST+(7,2) #FR+{6,3) #G+(U,7) #K+<7,4) 
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S=78{83,69) Mp= 930. 6 ( 0.2) Tp= 53 A2=608/32 P^GLVPFLVSV 
< yenpept PR^>gi i 13543657 | gb I AAH05978 - 1 i AAH05978 (BC005978) 

karyopherin alpha 2 (RAG cohort 1, importin alpha 1) [Homo sapiens] 
PQS=377 (SSQ ID NO:79)ref 

G=2095: Mg- 932.5: Tg=46+-0 #F+(13,7) #G+(1,1) #K+(2,1) 
S=72(74,69) Mp<* 932.5 ( 0.0) Tp= 46 A2= 54/27 P<=ILGLGYPSL 
genpept PR*>gi 1 7339520 I embl CAB82850 . 1 1 (AJ250717) procathepsin E 
[Homo sapiens] P03=184 (SEQ ID NO: 80) 



G<=2126: Mg<= 936.3; Tg=39+-1 #£+(1,1) #F+(7,5) #G+(1,1) 
S=81(86,72) Mp= 936. 4{ 0,1) Tp= 40 A2=213/26 P^ALLAGSEYL 
genpept PR^>gi| 12653123 | gb I AAH00328 . 1 [ AAH0Q328 (BC000326) eukaryotie 
translation initiation factor 3, subunit 7 (zeta, 66/67kD) [Homo 
15 sapiens] POS=439 {SEQ ID MO: 81) 

G=2146: Mg= 938.3: Tg=38+-0 #FR+(2,1) 

S=77 (72, 90) Mp= 938.5 ( 0,2) Tp= 34 A2=6$6/33 P<*SLAELVHAV 
genpept PR*>gi I 4092963 I gb | AAD04812 . 1 1 (AF033122) non-p53 regulated 
20 PA26-T1 nuclear protein [Homo sapiens] POS=254 (SEQ ID MOi 82) 

G=2160i Mg° 940.4: Tg^58+-1 #E+{6,4) #F+(8,3) #G+(1,1) #K+(3,2) 

3=82 (80,88) Mp= 94 0. 6 ( 0.2) Tp= 53 A2= 8/15 P-MQPILLLL 

genpept PR=>gi ] 16115 9 1 gb I AAB5 9528 . 1 1 (J03072) serine protease B 
25 [Homo sapiens] POS=l (SEQ I'D NO: 83) 

G=2176: Mg^ 942,1: Tg=48+-0 #G+(5,3) 

S=74 (74,77) Mp= 942. 5 ( 0,4) Tp= 48 A2= 2/12 P=GLFAPQFY 

genpept PR=>gi [ 2062371 [ gb | AAB65850 . 1 1 (U7073Q) SnoN2 (Homo sapiens] 
30 POS-274 (SEQ ID NO: 84) 

G=2208; Mg* 944.5: Tg=59+-2 #S+(2,1) #L+(4,2) 

S=S6(94,69) Mp= 944.5 ( 0.0) Tp= 48 A2-577/25 F-ALWGQGTLV 
genpept PR->gi 1 7 7 3 62 8 I gb | AAA8 8873.11 (U21267) immunoglobulin rou 
35 heavy chain [Homo sapiens] POS«103 (SEQ ID NO: 85} 

G-2213: Mg° 945.4: Tg=37+-1 #S+(12,1) #E+(60,10) #F+<6,5) 
#E$T+(5,2) #FR+(ll,3) #K+(lil) 

S=87 (97, 66) Mp=^ 945. 5{ 0.1) Tp= 34 A2^592/34 P=SLLCGDVVSV 
genpept PR=>gi j 5231131 1 gb | AAD41085 ♦ 1 1 AF1536Q3JL (AF153603) TSC-22 
related protein [Homo sapiens] POS=27 (SEQ ID~NO:7) 



6=2231: Mg« 947.3: Tg=41+-1 #£+(4,2) #FR+(1,1) #G+{.11,6) 
S=85{88,81) Mp= 947. 3 ( 0.0) Tp- 20 A2^0.0/18 P-DTETAWWV 
45 genpept PR->gi | 4 883681 1 gb I AAD31596 . 1 1 AF0573S2_1 (AF0S7352) 

hepatocellular carcinoma autoantigen [Homo sapiens] PQS*>117 (SEQ ID 
NO: 86) 

G^2233: Mg= 947.4: Tg=34+-1 #S+{9,1) #EST+(3,2) #PR+{4,2) 
50 S-90(90, 90) Mp« 947- 4 { 0.0) Tp- 30 A2= 70/23 P=NLTISDV$V 
genpept FR=>gi I 54168 0 1 emb | CAA56734 . 1 1 (X80761) MUC1 [Homo sapiens) 
P03=133 (SEQ ID NO: 8) ref 

Go2241: Mq= 948-3: Tg-58+-l #E+(6,3) #F+{4,3) #EST+(1,1) #FR+(3,2) 
55 #K+(5,3) 



29-MAY-2001 21:15 FROM: 



TO: 972 9 7706333 



P. 079 



78 

5-75(74,79) Mp~ 949 .5 ( 0,2) Tp= 62 A2=203/21 P=ALLPIFFGA 
crenpept PR=>gi 1 13185197 [eiab ICAC33282.il (AX083359) unnamed protein 
product [Homo sapiens] POS-43 (SEQ ID NO: 87) 

5 G*227Q: Mg= 951,6: Tg~40+-1 #D+(11,4> #£+(49,10) #F+(2 / li 
#ESt+(3,l) #FR+(7 f 3) #G+<11,6) #K+(4,2) #I.+ (4,2) 

S^SB (93, 79) Mp° 951, 5 (-0.1) Tp= 33 A2-191/22 P=AMVIFKSGV 
genpegt PR=>gi | 392 9525 f gb | AAC82612 , 1 1 (AF0346U) intrinsic 

£actor-B12 receptor precursor; cubilin [Homo sapiens] PQS a 3371 (SEQ 
10 ID NO:86) 

G=2299: Mg= 954.4: Tg=50+-0 #D+(1,1} #E+{31,9) #F+<15,7) 
#ES?+(5,2) #FR+(8,3) #G+{28,11> #K+(16,6) #L+(4,2) 

S=8l(89,63) Mp^ 954, 5 [ 0,1) Tp= 49 A2=162/34 P-SLLPAIVEL 
15 qenpept PR=>gi ( 3603418 ! gb]AAC63525 . 1 ! (AF0834 39) protein phosphatase 
2A regulatory subunit A, beta isoform [Homo sapiens] PGS«415 (SEQ 
ID NO:89)ref 

G=2329: Mg^ 956.6; Tg=33+-2 #E$T+(3,2) #FR+(5,3) 
20 S=B1(B3,79) Mp= 956, 5 (-0-1) Tp= 32 A2=736/32 P=YLGPHTASV 
genpept PR=>gi 1 12052942 I ernb I CAB6664 6. 1 1 (AL136711) hypothetical 
protein [Homo sapiens] POS=137 (SEQ ID N0:B9) 

G=2344: Mg- 958.3: Tg=33+~2 #S+(6,2) #D+<8 r 2) #E+(100,8) 
25 #G+(34,U) #K+{4,2) 

S«96(96,99) Mp« 958. 3( 0.0) Tp- 38 A2- -31/23 1?=SLWGQPAEA 
genpapt PR=>gi \ 4 63430 I gb| AAC27 816. 1 1 (U04520) type l'V collagen alpha 
5 chain (Homo sapiens] P0S=18 (SEQ id NO: 9) 

30 Go2350: Mg= 959.3: Tg=4 6+-1 #G+(14,7] #K+(8,5) 

S=79<90,54) Mp= 959. 6( 0,3) Tp<= 45 A2= 16/27 P=SLfPGQWI 
genpept PR=>gi 1 12654999 [ gb IAAH01347 . 1 IAAH0134 7 (BC001347) polymerase 
(DNA-directed) , alpha (70kD) (Homo sapiens] POS=295 (SEQ ID NO:90) 

35 G=2355; Mg^ 959.5: Tg=38+-0 #F+{3,2) 

S«82(80,89) Mp= 959.5 ( 0.0) Tp= 38 A2-324/29 P=SLLEKSLGL 
qenpept PR=>gi|13529002|gb|MH05291.1|AAH0529l (BC005291) eukaryotic 
translation elongation factor 1 epsilon 1 [Homo sapiens] POS=8 (SEQ 
ID NO: 91} 

40 

G^2356: Mg- 959.5; Tgo30+-1 #D+(3,1) #E+(29 # 9) #F+ (2,2) #EST+(7,2) 
#FR*(9,3) #G+(49,10) #K+{6,2) 

5=85(84,90) Mp*> 959.5 ( 0.0) Tp- 27 A2=485/24 P a XLTDITKGV 
genpept PR°>gi j 1819691 gb IAAA50388 . 1 | (M19997) elongation factor 2 
45 [Homo sapiens] POS«161 (SEQ ID NIG: 92) 

G°2372: Mg= 960 + 5: Tg=35+-1 #D+(2,1) #G+(5,5) 

S=78(73,90) Mp- 960.5 ( 0.0} Tp= 34 A2= 79/24 P-GLFQGKTPL 
genpept PR=>gi| 4S89929 1 dbjt BM76931.1| (AB024704) fls353 [Homo 
50 sapiens] POS=53 (SEQ ID MO: 93) 

G=2382: Mg= 962.3: Tg«47*-0 #G+($,4) 

S=80 (82,77) Mp« 962,5 ( 0.2) Tp= 11 A2=0.0/6 P^ESQLKKMV 
genpept PR=>gi 1 12803337 j gb [AAH02487. 1 1 AAH023 67 (0COO24 87) tumor 
55 susceptibility gene 101 [Homo sapiens] POS=5 (SEQ ID NO: 94) 

G=2434: Mg= 967.3: Tg=54+-Q #G+(10,7) 
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5-83 (85,79) Mp= 967,5 ( 0,2} Tp= 61 A2=139/19 P^FLYPFPLA 

genpept PR=>gi| 436224 [ dbj |BAA05062.1| (D26067) KIAA0Q33 [Homo 
sapiens] POSKL85 (SEQ ID NO: 95) 

G^2446: Mg= 966.4; Tg-20+-2 #F+(4 J 2} #G+(19,6) 

3=8609, 79) Mp= 968,4 { 0.0) Tp- 29 78/29 P-ALTGHLEEV 

genpept PR*>gi| 343B8 |erab| CAA29338.il (XQ5908) lipocortin (AA 1-346} 
[Homo sapiens] POS=99 (SEQ ID NO; 96) 

G=2447: Mq^ 969.4: Tg=42+-1 #£>+(l,U #£+(1,1) #F-H11,7) #FR+(1,1) 
#G+(27,11] #K+{6,3) #L+<3,2) 
S=87{87,90) Mp= 968.4 ( 0.0) Tp= 36 A2->lk/33 P=SLLDPVFEV 
genpept PR^>gi | I504O20 1 cibj |8AM3209.1| (D86973) similar to Yeast 
translation activator GCN1 (P1;A48126) [Homo sapiens] POS=1406 (SEQ 
15 ID NO:97) 

G=2464: Mg= 969.5: Tg«4 7+-0 #E+(2,2) #F+(7,5} #Gf(25,ll) 
5=33 (85, 81) Mp^ 969. 5 £ 0.0) Tp= 48 A2^ 1/19 P-MAPQALLLL 
genpept PR=>gi 1 1780998 f embl CAA71531 . 1 1 (Y10520) HLA-C alpha chain 
20 (Cw*110l) [Homo sapiens] POSH (SEQ .ID WO: 96) 

G=2489: Mg^ 971,5: Tg-42+-0 #D+(9,4) #F+(10,6) 

S=83(91,81) Mp= 971,4 (-0.1) Tp= 42 A2^ 1/23 F-FSNGYLASL 
genpept PR^>gi 1 12655065 | gb I AAH01382 . 1 (AAH01382 (BC001382) solute 
25 carrier family 29 (nucleoside transporters), member 1 [Homo sapiens] 
POSH 05 (SEQ ID NO: 99) 

G=2495: Mg= 972.4: Tg-52+-l #D+(25,5) #E+(19,9) #F+{24,7) 
#EST+(5 # 2) #FR+(8,3) #G+(43,11) #K+{32,6) #L+(4,2) 
30 S-91(96,81) Mp= 3*72.5 ( 0.1) Tp= 41 A2=656/30 P-TLIEDILGV 
genpept PR=>gi|11121497|emblCAC14946.1| (AL132825) ciJ756N5*2 (novel 
protein (DKFZp727M23l) similar to Trp4-associated protein TAP1 
(ABCB2) ) [Homo sapiens] POS=209 (SEQ ID NO:lQ0) 

35 G=2514: Mg= 973.4: Tg=33*-1 #F+(5,3} #G+(10,6) #K+(4,2) #L+<1,1) 

£^82(84,79) Mp= 973.4 ( 0.0) Tp= 31 A2=O.0/17 P=IAEAVRTTL 
qenpept PR->gi I 2559010 | gb I AAC96011 , 1 i { AF026292 ) chaperonin 
containing i-complex polypeptiae 1, eta subunit; CCT-eta [Homo 
sapiens] POS^32 (SEQ ID NO: 101) 



40 



45 



G=2515; Mg= 973.5: Tg=34-i-l #ESr+(5 r 2) 

5=30(80,81) Mp= 973,4 (-0,1) Tp- 31 A2=307/27 P°KLSFLEAAL 
qenpept PR=>gi] 12314174 | emb ) CAC03001. 1 | (AL137Q67) bAl3B9.3 {novel 
protein similar to KRT8) [Homo sapiens] POS=368 (SEQ ID NO: 102) 



G=2522: Mg= 974.3: Tg=30+-2 #S+(7,D #fi+(3 f 3) #EST+(8,2) 
#FR+(H,3> 

$=89(90,8 9) Mp= 974.5 ( 0,2) Tp- 25 A2= 6/21 F=SLSVKLEQA 
qenpept PR->g± I 37258 ! emb I CAA44819 . 1 1 (X63105) Xpr [Homo sapiens] 
50 POSH 53 {SEQ ID N0:1Q4) 

G=2527: Mg= 974.3: Tg-S0+-Q #D+{1,1) #E+(15,7) #F+(9,4) #G+(22,10) 
ftK*** ( 10 5 ) 

5=90(98,72) Mp- 974. 5( 0.2) Tp= 50 A2-413/26 P=MLLAALMIV 
55 qenpapt PR=>gi I S802822 I gb | AAD51798 . 1 1 Afl646l4_2 (AF164614) envelope 
protein [Homo sapiens] POS=76 (SEQ ID NO: 105 ) 
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G=2537: Mg*= 974,5: Tg=53+-0 #F+(4,3) #G+{13,9) #K+{2,2) 
S==81(83, 79) Mp^ 974.5 [ 0,0) Tp^ 56 A2^ 60/24 P^AXLPTSXFL 
qenpept PR=>gil2323410|gb|AAB66581.1| (AF015913) SkblHs [Homo 
sapiens] POS-229 (SEQ ID NO: 106) 

5 

G=2546: Mg= $75.4; Tg=38+-1 #E+ (3,2) #F+(7,4) #G+(19,10) #K+(1,1) 
S=82(91,63) Mp= 975 . 4 ( 0,0) tp- 32 A2= 8/27 P=AALPNVYEV 
qenpept PR<=>gi| 12652781 igb|AAH00l42.1 1MH00142 (BC000142) 

mini chromosome maintenance deficient (S. cerevisiae) 5 {cell 
10 division cycle 46) [Homo sapiens] POS<=026 (SEQ ID NO: 107) 

G=2567: Mg= $77.5; Tg°22+-3 #G+(9,5) 

S»84 (82,90) Mp- 977.4(-0.1) Tp^ 24 A2-186/24 P-RMLPHAiPGV 
qenpept FR=>gi | 1667394 ] gb \ AAC50814 , l f (U31B14 ) transcriptional 
15 regulator homo log RPD3 [Homo sapiens] POS=372 [SEQ ID NO: 108) 

G-2610: Mg^ 981.7: Tg=36+-0 #S #F+(3,2) 

S-79(80,79) Mp= 981.6(-0.1) Tp~ 38 A2= 49/32 P«SLIGHLQTL 
qenpep t PR=>gi I 642013 | gb J AAB06261 . 1 J (U16996) protein tyrosine 
20 posphatase [Homo sapiens] POS=337 (SEQ ID NO: 10) 

G^2636: Mg= 984.5: Tg*61+-1 #D+(S,4) #£+(9,5) #F+(l2,6) #FR+<5,2) 
#G+{2,1) #K+<12,5) #L+(1,1) 

S«85 (91, 72) Mp= 984,7 ( 0.2) Tp= 61 A2*= 11/21 P=LMVLVALIL 
25 qenpept PR=>gi ] 12654925 I gb I AAH01309. 1 (AAHO1309 (BC001309) Unknown 
(protein for MGC:5508) [Homo sapiens] POS-19 (SEQ ID NO: 109) 

G»2641: Mg= 984,7: Tg-36+-0 #EST+(1,1) #FR+<2,1) 

S-7B(77,81) Mp° 983.5{-l-2) Tp= 35 A2=140/28 P-KILPTLfiAV 
30 qenpept PR->gi i 12653227 igb | AAH00382.il AAH003S2 (BC000382) 

interleukm enhancer binding factor 2, 45kD (Homo sapiens] POS-127 
(SEQ ID NO: 110) 

G-2649: Mg° 965,5: Tg=40+-1 #E+(5,3) #FR+(3,2) #G+(4,3) 
35 5=84(93,63) Mpo 985, 6{ 0.1) Tp= 38 A2->ik/33 P^ALLDRIVSV 
qenpept PR=>gi 1 1504030 ) dbj ! BAA13214 . 1 ! (D86978) similar to a 
C.elegans protein encoded in cosmici K12D12 (249069) [Homo sapiens] 
FOS=»14$9 (SEQ ID NO:lll) 

40 G-2661: Mg- 986.6: Tg*35+-1 #E+{3,2) #F+(3,2j #EST+(3,2) #FR+(1,1) 

f^{%\\m Mp^ 986. 7( 0,1) Tp* 35 A2*160/26 P«TW«WGV 
qenpept PR->gi 1 354 0219 1 dbj I 3AA32662 * 1 1 (D87686) KIAA0017 protein 
[Homo sapiens!) POS=X65 {SEQ ID 150:112) 

G=2666: Mg« 987,4; Tg=32+-2 #D+<1,1) tt+U,U #F+(5,2) »f f+ t"' 7 ) „ 
8-77(87,54) Mp= 987. 5( 0.1) Tp- 33 A2-131/26 P^YLPPATQW 
g^pipt PB^gi 1 1332SH 6 I gb I AAB04386 - 1 IMH04386 (BC004386) KIAA01U 
gene product [Homo sapiens] POS-207 <S£Q ID NO: 113) 



45 



50 



55 



r=266B- Ma= 987,4: Tg=14+-13 *F+(4,3) 

VlT^A) M P = 987.3(-0.1> Tp= 26 AZ-0.0/15 F=PMEALAEQV 
qenpept ttUgi I 3B82297 | dbj | BAA34508 . 1 | (AB01B331) KIAA0789 protein 
[Homo sapiens] POS-569 (SEQ ID NO: 114) 

G=2671: Mg= 987.6: ?g=29+-l #F+(4,3} #EST+(1,U #FR+(3,2) 
#G+(11,5) 



81 



5=74(83,54) Mp= 987,5 (-0.1) Tp= 33 A2=656/30 P^RLSEAIVTV 
genpept PR=>gi | 7106848 | gb|AAF36149 . 1 IAF151063J. (AF151063) HSPC229 
(Homo sapiens] POS-137 (S£Q ID NO: 115) 

5 G-2677: Mg=> 988,3: Tcpl3+-4 *E+(2,1) #F+(7,1> #EST+(4,1) #G+(2l,6) 
5^88(99,63} Mp*= 986. 4 ( 0,1) Tp= 20 A2<* 28/27 P^SLDQPTQTV 
genpept PR«>gi | 1719197 |gb | AAD034 62 . 1 | (U46025) translation inUation 
factor eIF-3 pllO subunit [Homo sapiens] POS=834 (SEQ ID NO: 116) 

10 G=2692; 989.4: Tg=41+-1 #S+(8,2) #D+(13 f 5) #E+(12,6) #F+(ll f 6) 

#EST+ {4/ 2) #FR+[6,3) #G+(15 # 9) #K+(13,6) 

5=79(83,72) Mp^= 969. 5( 0-1) Tp= 39 A2=257/30 **SI.FPGKLEV 
genpept PR*=>gil440177|gfo|AAC03563.1i (UGU84) flightless-I homolog 
[Homo sapiens] POS=1009 (SEQ ID NO: 12) 



G=2693: ttg= 989.5: Tg=*31+-2 #S+U5,2) #E+(7,3) #F+(13,5) 
#EST+(3,1) #G+(12,7) tfK+(6,4) 

$^83(84,81) Mp*= 989, 5 { 0.0) Xp= 35 A2^ 38/29 P«SLSEKTVH. 
genpept PR=>gi 1 160151 1 gb I AAA88793 . 1 i (M84349) CD59 protein [Homo 
20 sapiens] POS=106 (SEQ ID NO; 11) 

G-2729: Mg= 993.5; Tg=18+-4 #F+(2,1) #EST+(4,2J ttFR+(8,3) #G+(9,4) 



S=92(97,81) Wp= 993. 6 ( 0,1 J Tp= 22 A2=243/23 P-KLHGVNINV 
25 genpept PR=>gi { 12653083 { gb I AAHO03O7 . 1 1 AAH00307 (BC000307) RNA 
binding motif protein 4 [Homo sapiens] POS=59 (SEQ ID NO: 117) 

G=2769: Mg- 999.5: Tg^35+-1 #H+{5,3) #I+(8,4) #J+(5,4) 
S-B2(83,B1) Mp- 999, 5 ( 0.0) Tp= 39 A2= 5/16 P=LVMAPRTVL 
30 genpept PR=>gii973B916lgb|AAF97S47,l| (AF129293) MtfC class I ant.lgen 
[Homo sapiens] POS=2 (SEQ ID NO: 118) 

02773: Mg= 999.6: TgHS+-l #D+(2,1) #E+(15,6) #F+(l2,7) 
#EST+(3,1) #FR+{6,3) #G+(15,8) #K+{11,5) #L+U,1> 
35 3-60(86,69) Mp= 999. 6 ( 0.0) Tp= 42 A2= 22/31 F-SIKRLLEV 
genpept PR=>gi 1 190516 | gb I AAA36508 , 1 1 (M63960) protein phoaphrtase-l 
[Homo sapiens] POS=U (SEQ ID WO: 119) 



15 



#K+(l f 1) 



40 




(SEQ ID NO:120) 



45 




Tg-26+-2 #F+ (3, 2) #EST+ (2,2) #FR+ (2, 1) 



50 



ID NO:121) 



55 
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G=2822; Mg-1004.2: Tg«27+-1 *G+ (6,5) 

3=90(91,90) Mp=l0Q4.4( 0.2) Tp= 30 A2= 88/25 F-TLSEVTNQL 
genpept PR=»gi | 12053045 | emb I CAB66698 , 1 | (AL136764 ) hypothetical 
protein [Homo sapiens] P0S=484 (SEQ ID NO: 123) 

5 

G=2329: Mg=1004.5: Xg=38-f~l #F+(3,2) #EST+(1,1) #FR+(2,1) #G+(9,6) 
S=91(92,90) Mp=1004.6( 0.1) Tp- 37 A2=324/29 P=ALFEGKVQL 
genpept PR=>gi 1 10439712 j dbj | BAB15S50 . 1 1 (AK026780) unnamed protein 
product [Homo sapiens] P0S^4 42 (SEQ ID NO: 124 ) 

10 

G=2833: Mg=1004,<5: Tg=29+-0 #EST+(3,1) 

S=*87 (37,89) Mp^l004,6( 0.0) Tp= 31 A2 = 32/28 P~GLKGRVFEV 
genpept PR=>gi | 854179 1 emb I CAA60827 , 1 ( (X87373) ribosomal protein S3& 
[Homo sapiens] POS-61 £SEQ ID NO: 125) 

15 

G-2835; Mg=1005.2: Tg°48+-0 #G+(3,3) 

S=84{83,89) Mp;=1005.5( 0,3) Tp^ 42 A2= 35/25 P=NIFP*PVGV 
genpept PR->gi 1 2822460 1 gb | AAC39565 . 1 1 (AF030234) splicing factor 
Sipl [Homo sapiens] ?OS=912 (SEQ ID NO: 126) 

20 

G-2872: Mg=1009,6; Tg°47+-1 #E+(2,1) #EST+(5,2) #FR+(G,3) 
#K+{15,6) 

S=37(96,66) Mp=1009,7( 0.1) Tp= 52 A2= 3/18 P^LVSIWAVPL 
genpept PR=>gi| 7023136 | dbj IBAA91851.il (AK001708) unnamed protein 
25 product [Homo sapiens] POS-23 (SEQ ID NO; 127) 

G=2881: Mg=1010.5: Tg=28+-1 #G+(8,5) 

S=84 (85,82) Mp=10lO.S( 0.0) Tp* 20 A2=370/30 P=3SfMYGKWTV 

gsnpept PR=>giil845267|gb|AAC51l02.1i (U56402) SDFT5H (Homo sapiens] 
30 P0S=562 (SEQ ID NO: 128) 

G=2891; Mg=1011.5: Tg=43+-1 #E+(9,5) #F+(1,1) #EST+<3,2) #FR+(6,3) 
#G+(13,7) #K+(5,3) #L+{3,2) 

S=79{78,82) Mp=1011.5{ 0.0) Tp= 45 A2=>lk/31 P=LLLDVPTAAV 
35 genpept PR°>gi I 6165613 I gb I AAFQ4 616 . 1 1 AFQ97362JL (AF097362) 

gamma-intarf eron inducible lysosomal thiol reductase [Homo sapiens] 
POS=26 (SEQ ID NO: 2) ref 

G=2918: Mg=1014.4: Tg°48+-0 #D+(1,1] #E+{16,8} #F+(11,7) 

40 #EST+(2,1) #FR+(3,2) #6+119,10) 

S=88 (97,68) Mp=1014.6( 0.2) Tp= 4 6 A2=16Q/32 P=SLINVGLISV 
genpept PR=>gi | 126534 13 [ gb j AAH004 7 6 . 1 1 AAHO047 6 (BC000476) acidic 
protein rich in leucines [Homo sapiens] P05-46 (SEQ ID NO: 129) 

45 G°2928: Mg=1015.4; Tg=56+-0 #E+<26,8) #EST+(2,1) #FR+(5,3) 

S=92{97,81) Mp=1015.5( 0,1) Tp- 61 A2=666/30 P^ALLGTLWEI 
genpept PR=>gi | 2224595 I clbj ! BAA20785 . 1 1 (AB002325) KIAA0327 protein 
[Homo sapiens] POS=18 (SEQ ID NO: 130) 

50 G-2929: Mg=10l5.4; Tg°41+-1 4E+(5,3) #EST+(4,2) #FR+<2,2) 
#G+(12,7) #K+(5,3) 

S«8K86,72) Mp=1015.5( 0,1) Tp= 39 A2= 13/16 P^FQDPVPLTV 
genpept PR=*>gi I 4325107 I gb I AAD17258 , 1 1 {AF11904 2 ) transcriptional 
intermediary factor 1 alpha; TIFlalpha [Homo sapiens] POS=890 (SEQ 
55 ID NO:13l) 
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G=2947: Mg=1016.4: Tg=45+-1 #E+{3,3> #F+{8,5J #EST+{2,2) #FR+{3,1) 
#G+(18,10) #K+<7,4) 

5^82(95,54) Mp=1016.6( 0.2) Tp= 39 A2=>512/28 F-GLYPNLIQV 
qenpept PR=>gi | 4240269 | d&3 I BAA74 913, 1 ! (ABQ20697) KIAA0890 protein 
5 [Homo sapiens] POS-1022 (SEQ ID NO; 132) 

G^2965: Mg=1018.4: Tg=23+-4 #D+£3,1) #E+(2,2) #F+(2 # 2} #G+(25,8} 
3=94(96,90) Mp=10i8,6( 0.2) Tp= 19 A2= 7 9/26 P-VtoDSKIVQV 
genpept PR=>gi | 913393 | gbiAAC60648 . 1 | (S75295) nucleoprotein 
10 interactor 1, NPI-1=SRP1 homolog [human, cervical carcinoma HeLa 
cells, Peptide, 538 aa] [Homo sapiens] POS=434 (SEQ ID WO: 133) 

G-2976: Mg=1019.6: Tg=46+-0 #D+(5,2) #E+{6,1) #F+(2,2) #EST+U,1) 
#FR+ (2iU »G+{4,3) 

15 3-83(81,90} Mp<=l019.6( 0,0) Tp= 40 A2°745/32 P^ALLDKLYAL 
qenpept PR=>gi 1 7023341 1 dbj | BAA91 929. 1 1 (AK001630) unnamed protein 
product fHonio sapiens] POS=78 (SEQ ID NO: 134) 

G=2985: Mg=l02O,5: Tg=45+-0 #D+(5,3) *E+{3,2) #F+<4,3) #FR+(1,1) 
20 #G+(7,5) 

5*99(99,99) Mp^l020.5( 0.0) Tp=> 40 A2=298/27 F-NLASFIEQV 
qenpgpt PR=>gi I 348907 i gb I AAA35672 * 1 1 (L15428) 4a-carbinolamine 
dehydratase [Homo sapiens] POS=19 [SEQ ID 130:135) 

25 G=2998: Mg=1022.4: Tg=44-c-0 #0+15, 3) 

S-76(7O,90) Mp=1022.4( 0.0) Tp= 43 A2=0.7/12 P=TLWVDPYE 
genpept FR=>gi 1 1703501 |gb [AAB37580.il (U72649) BTG2 [Homo sapiens] 
POS=101 (SEQ ID NO: 136) 

30 G=30O2: Mg°1022.5: Tg°45+-1 #S+{2,1) #D+(3,2) #G*{7,4) 

£-=82 (83,31) Mp=1022,5( 0.0) Tp= 42 A2=>lk/25 P=KIADFGWSV 
qenpep t PR~>gi i 3127 068 1 gb I AAC77369 . 1 1 [AF059681 ) serine/threonine 
kinase 13 [Homo sapiens] POS=147 (SEQ ID NO: 137) 

35 G-3036: Mg^l025.5: Tg-37+-l #S #D+(1,1) #F+(4,2) #EST+(1,1) 

3=90(91,90) Mp=1025.6( 0.1) Tp- 36 A2= 89/26 P-SLLSHVEQL 
genpept PR=>gi|5305429jgb|AAD41647.1|AF072933_l (AF072933) Mad2-lik® 
protein [Homo sapiens] POS°114 (SEQ ID NO: 138) 



40 



45 



G=3041: Mg°1026«3: Tg=45+-0 #D+(7,3) #FR+(1,1) #G+<4,3) 
S=84(90,72) Mp=1025.6(-0.7) Tp= 38 A2=>lk/30 P=GLADKVYFL 
qenpept PR->gi 1 1226049 1 dbj I BAA11423 . 1 1 (D78586) multifunctional 
protein CAD [Homo sapiens] POS-445 (SEQ ID NO: 139) 



G-3061: Mg-1028.5: *g=35+-l *9+{6,2) #D+(3,U #E+(20,7) #F+(8,5) 
#E$T+<5,2) #FR+(5,2] #G+(11,7) 

S-68{92,81) Mp°1028.5( 0.0} Tp= 32 A2= 88/28 P=GLI£KN1EL 
qenpept PR=>gi 1 1632819 1 emb I CAA4 5219 . 1 1 (X63692) DNA 

50 (cytOSine-5-)-methyltransf erase [Homo sapiens] POS*425 (SEQ ID 
SO:13) 

G=3073: Mg^l029.5: Tg*Sl+~0 #D+U,U #FR+|2,1) ttG+(5,4) #K+(2,1) 
S=81(78,90) M P =1029.6( 0,1) Tp= 35 A2=>lk/31 P=SLLDIXEKV 
55 genpspt PR=>gi[l063586lgblAAB41564,l! (L48546) tuberin [Homo 
sapiens] POS=526 (SEQ XD NO: 140) 
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G=3092; Mg~103l . 4 : Tg»61+-1 #S+(8,2) #D-h(29,4) #E+(10,6) #F+(3,1) 
#EST+{2,1) #FR+(5,2) #H+fl,l) #JC+(X,1J 

S=84 (82, 90) Mp-1031.6( 0.2) Tp= 64 A2-865/30 P=GLYPGLIWL 
genpept PR=>gi | 25^9395 ! gb I AAB84111 . 1 1 (AF027292) interferon 
5 regulatory factor 6 [Homo sapiens] POS=2l (SEQ ID NO:14) 

G=3118; Mg^l034,4: Tg=60-h-l #D+(16,5) #E+(45,9) #F+{14,7) 
#ESTt(2,l) (FR+(1Q,3> #G+tll,7) #K+P/4) 

3=81(85,72) Mp=1034.6{ 0,2) Tp= 66 A2- 32/21 P=FVFPGELLL 
JO genpept PR=»gi [12652633 i gb IAAK0O062 , 1 1 AAH0OG62 (BC000062) solute 
carrier family 1 (neutral amino acid transporter), member 5 (Homo 
sapiens) P0S=B9 (SEQ ID NO; 141) 

G-3127: Mg-1036.3: Tg=36+-0 ^F+{2,1> 
15 S^78<77,81) Mp-1036,6( 0.3) Tp* 35 A2=656/3Q P=ALNELLQHV 
qenpept PR=>gi| 6682361 |gb|AAF23322 . 1 1AF177198JL {AF177198} talin 
[Homo sapiens] POS<=777 (SEQ ID NO;142)ref 

G=3128: Mg-1036,3: Tg-36+-l #G+(12,7) 
20 3-83(64, 61) Mp=1036.5( 0.2) Tp= 29 A2=913/27 P-NLYEGQITV 
genpept PR=>gi 1 1699038 I gb| AAC50967 . 1 1 (U78735) A3C3 [Homo sapiens] 
POS-555 (SEQ ID NO: 143) 

G»3142: Mg-1037,5: Tg-43+-l #EST+{2,2) #FR+[5 f 2) #G+<1,15 
25 #I+(U,5> #J+(S,3> 

5-86(89,79) Mp-1037.5< 0.0) Tp* 41 A2-0.1/15 P=FTKDFA?VI 
qenpept PK->gi 17022324 1 clbj IBM91736.il (AK001518) unnamed protein 
product [Homo sapiens] POS-77 (SEQ ID NO: 144} 

30 G-3144: Mg-1037.6: Tg=51+-1 #D+(7,3) #E+(29,6) #F+(U,7) 

#EST+<2,D m+{4 p 2) #G+(12,7) #K+{3,2) #L+|1,1) -a-KT LEPVLLL 

$=87(86,90) Mp-1037>7( 0.1) Tp= 53 A2*>lk/31 P-KLkEPVLLL 
qmP ept PR=>gil3384«7|gblMA60583.1| (M60854) RPS16 [Homo sapiens] 
POS-50 (SEQ ID NO:145)ref 

35 G=3154; *g=1038.5: Tg=48 + -1 #D*<32,4> « + [48,» W+(7,3> 

#EST + (6,2) #FR + {9,3) #<*<24 10, « + <9, 6> p.^PAIVHI 

40 SSSa-Saip/Hi.) box polypeptide 17 (72kD)J [Kflino «pien,J 
POS=146 (SEQ ID NQ:15) ref 

G-3183: Mg-1041.4: Tg=52+-0 #FR+ (2, D #C* U.: |> p=GLFAPQFYV 
L-!d/R2 72) M»=1041.6< 0.2) Tp= 52 A2=>1W23 P-GLtAfUtiv 
fe^ept' AUgi I gb I MB65850 ■ X i (U70730, SnoN2 [Ho.o salens] 
POS=274 (SEQ ID NO: 146) 

Sct^r RWA activator isoform 2 (Homo sapiens] 
NO:147) 

G-3201: MOM043.5: Tg^58 + -1 #E + (2,2) #F + (6,4) #EST + (1,D #»+^> 
55 #G+(1,4) #K+ (1/D 
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S^85(88,81} Mp=1043,7( 0.2) Tp= 62 A2^408/27 P-FLLPILSQI 
gpngept PR=>gi I 2330552 j gblAAC51830, 1 1 (AF000983) dead box, X isoform 
[Homo sapiens] POS=234 (SEQ ID NO; 148) 

5 G=3213; Mg=1045,5: rg=56+-0 #E+(1 # 1) #F+{2,2) #FR+(1,1) 

S=84{90,72) Mp=1044.5(-1.0) Tp= 54 A2°0,3/18 P=FLIPLNITN 
genpept PR=»gi ! 2224611 |dbj 1 BAA207 93 . 1 j (AB002333) KIAA0335 (Homo 
sapiens] POS=938 (SEQ ID NO: 149) 

10 G=3219: Mg=1046.6; Tg=40+-1 #D+<2,2) #E+(1,1> #F+(2,1) #EST+{4,1) 
#FR+(5,2) #G+CL,1) #K+(3,3) 

5=85(83,90} Mp<*1046.7( 0,1) Tp=> 39 A2*=243/30 P-NLLPKLHIV 
genpept PR^>gi I 4588524 fgfo |MD26136. 1 |AF109196_1 (AF109196) 

intracellular chloride channel p64Hl [Homo sapiens! PQS-190 (SEQ ID 
15 NO:150) 

G^3227; Mg=1047.6: Tg=44+-G #D+(1,1) #£+(15,8) #F+(2,2) #EST+{5,2) 
#FR+(7,3) #G+(12,6) 

S=79 (82,72) Mp=1047.6{ 0.0) Tp- 50 A2=413/31 P<,LDRFLATV 
20 gen pept PR=>gi 1 12653303 I gb | AAHQ0420 . 1 [ AAHOOfl 20 (BC000420) cycUft I 
[Romo sapiens] POS-72 (SEQ ID NO: 151) 



5=77(79,74) M P =1049.5( 0.1] Tp= 36 A2=294/29 P-YLDPSVLSGV 

25 genpept PR=>gi ! 505098 t dbj ] BAA0 6683 . 1 1 (D31385) KIAA0069 [Homo 
sapiens) POS=84 (SEQ ID NO; 152) 

0^3242: Mg*=1049 . 5 ; Tg=>44+-0 #F+(3,6) 

S=78 (85, 63) Mp=1048.5(-1.0) Tp= 44 A2°378/27 P-LLYPTEITV 
30 genpept PR=>gi{220141 1 cibj IBAA00845.H (D0IO38) VLA-3 alpha subunit 
[Homo sapiens] POS=-798 (SEQ ID NO: 153) 
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£3-3257; Mg*l051 t 4: Tg=65+-1 #D+(2,2) #E+(9,4) #F+(9,4) #£ST+(2,1) 
#FR+(6,3) 

S=88{88,90) Mp-105l.6( 0.2) Tp= 63 A2=>lk/26 F-NLGDFLIFL 
genpept PR=>gi 114 69175 jdbj 1BAA09475.11 (D50916) The KIAA0126 gene is 
partially related to a yeast gene, [Homo sapiens! POS=638 mo ID 
NO: 154) 

(3=3258: Mg-1051.4: Tg=54+-0 #D+(lB r 4> #E+(10,6) #F+(8,5) #G+U,D 

lSi{8s!66J Mp=1051.S( 0.1) Tp= 56 A2=>U/30 P^YSGLTWL 
genpept PR=>gi 1 178989 1 gb | AAA9Q928 . 1 i (M57763) ADP-ribosylat*on 
factor [Homo sapiens} POS=161 (SSQ ID NO: 155) 

G»3270: Mg-1054.3: Tg=51+-0 #D+(5,3) #£+(19,8) #F+<12,7) 

2S;i 2 «:ll, ln, *^54. S ( 0.2, ^ <S P=SLFDLNFQA 
qenpept PR=>gi 1 189292 I gb I AAB60701 . 1 1 (M81600) NAD (P) fl :quinone 
oxireductase [Homo sapiens] POS=227 (Seq ID NO-. 156) 



G=3271: M<?=1054.3: Tg=55+-1 #K+(S,2} 

S=80(77,30) Mp=1054.4( 0.1) Tp= 43 A2=0.0/S P-MFSLEDSII 

qenP ept PR=>gj. 1 809029 1 emb I CAA57S93.il (X82G76) tyrosine phosphatase 
55 (Homo sapiens] POS=833 (SEQ ID NO:157) 

G=3279: Mg=105S.4: Tg=37+-1 #G+(6,4) 



G=3240: Mg°1049.4: Tg=41+-1 #E+(3,2) #F+(2,2) #EST+{2,2) #FR+(3,3) I 



I 
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5=76(74,81) Mp=lG55.3{-0.1> Tp= 37 A2-122/19 P-AMWEHPITA 
genpept PR=>gi i 10197638 } gb IAAG14 955 . I [AF1824I9J, (AF182419) MDS01S 
[Homo sapiens] POS=65 (SEQ ID NO; 158} 

5 6=3297: Mg=1057.5: Tg=17+-2 #G+{8,4) 

S-95{94,99) Mp=1057.6( 0.1} Tp- 31 A2^320/26 E^YLGRLAHEV 
genpept PR=>gi i 12653485 j gb]AAH00514 . 1 1 AAH00514 (SCOO0514) ribosomal 
protein L13a [Homo sapiens] POS=137 (SEQ ID TO: 159} 

10 G=3309: Mg<=1059.5: Tg«34+-0 #F+(3,3) #EST+(3,1) #FR+(2,1) 

S-82{84,79) Mp-1059.6( 0.1) Tp« 30 A2=482/24 SM3LIDHQTYL 
genpept PR=>gi ( 1477651 i gb| AAB05428 » 1 ] (U63610) plectin (Homo 
sapiens] POS-U88 (SEQ ID NO: 160) 

15 G*=3325: Mg-1061.4: Tg°40+-1 #G+{6,3) 

3=85(87,81) Mp=1061.7( 0.3} Tp= 31 A2°523/26 P=AIQDKLFQV 
genpept PR c =>gi I 13543970|gb IAAH06123.1 | AAH06123 (BC006123) Similar to 
RIKEN cDNA 0710001P09 gene [Homo sapiens] POS=96 (SEQ ID NO: 161) 

20 G^3329; Mg*1062.4: Tg=29+-0 #H+(6,3) 

S=89(89,89) Mp°1062.5( 0,1) Tp= 27 A2~0.0/9 P=XVKWDRDM 
genpept PR=>gi f 179318 | gb ! AAA51311 . 1 1 (M17987 ) beta-2-microglobuUn 
[flomo sapiens] POS-112 (SEQ ID NO: 162) 

25 G-3331: Mg=1062.5: Tg=33+-1 »F+(9,6) ♦EST+<1,1) #G+U8,10) 
#K+(4 f 3) 

S*86(97,63) Mp<062.6( 0.1) Tp- 30 A2= 6/20 P=RIIDVVYNA 
genpept PR=>gi | 36150 I emb ICM47 67G . 1 1 (X67247) ribosomal protein S8 
[Homo sapiens) POS=77 (SEQ ID WO: 163) 



30 
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G=3342: Mg=1064.4: Tg°20+-4 #E+(lil) #£"+(6/ 4} #EST+{3,1) #G+[16,6) 
S«86(88,82) Mp-1064.6( 0.2) Tp= 19 A2-439/26 P=KIY£GGVEV 
genpept PR=>gi I 550013 1 gb I AAA85654 . 1 1 (U14966) ribosomal protean L5 
[Homo sapiens] POS*117 (SEQ ID NO: 164) 



G-3364: Mg=1066,3: Tg=50+-0 #£+(3,2) #G+(i8,10) *K+(9 F 5) #L+U,U 
S=81{85,72] Mp=1066.6{ 0.3) Tp= 50 A2-736/26 P-FLPSVIIDV 
genpept PR->gill0455-?4lgbi»AC50293-i| (U37012) cleavage and 
polyadenylation specificity factor [Homo sapiens] POS=l85 {SEQ ZD 
40 NO: 165) 

G=3384: Mg=1068.4f Tg=29+-2 #$+(8 f 2) #D+(3,1) #£+(3.7,4) #F+(8,5) 
#EST+(5,2) #FR+{6,3) #G+(18,7] #K+(4,2) 

3=67(90,81) Mp=1068.6[ 0.2) Tp= 29 A2=462/23 P-ALSDHHTYL 
45 genpept PR=>gil28597|emb!CM2B861.1| (X05236) aldolase A (AA 1-364) 
[Homo sapiens] POS-216 (SEQ ID NO: 16) r#f 

G-3385: Mg=1069.3: Tg=37+-1 #F+(2,2) lG+<7,5) 

S=94{92,6S1 Mp=1069.3( 0.0) Tp= 25 A2-355/25 P-YMMPVNSEV 
50 genpept ?»P>gi 1 12667401! gb|AAK01426.1 1 AF32673X_1 (AF326731) NUF2R 
[Homo sapiens] (SEQ ID WO: 166) 

G=3406: Mg=1071.S: Tg=20+-3 *D+(1,D #FR+(2,1) #G+(20,7) 
S=94(97,90) Mp-1071.6( 0.1) Tp= 24 A2=XG9/3G P=ILDQKIKEV 
55 genpept PR=>gi 1 338278 | gb | AAA60563 , 1 1 (M31061) ornithine 

decarboxylase [Homo sapiens] POS=23 {SEQ ID NO: 17) ref 
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G=3410: Mg-1071.6: Tg=8+-8 #D+(7,1) #F+(6,1) #FR+(6,1) #G+(25,5) 
#K+(4,3) 

S=94{96,90) Mp=1071.7( 0.1) Tp- 12 A2= 53/29 P=ILDKCTEKV 
qenpept PR«>gi | 3867 86 | gb| AAA36Q2 6. 1 1 (J049B8) 90 kD heat shock 
5 protein [Homo sapiens] POS-570 (SEQ ID NO: 18) rex 

G=3418: Mg=l073.6: Tg=5+-7 #F+{1,1> #6+(3,3> 

3-79(79,81) Mp«1072.5(-l.l) Tp= 5 A2-0.1/14 P=NKDLKMPKV 
qsnpept PR->gi|1808578|dfoj IBAA079l8.il (D44466) proteasome subunit 
10 pll2 [Homo sapiens] POS°792 (SEQ ID MO; 167) 

G=3424: Mg=l074.6; Tq°46+-0 #K+(6,4) 

$=90(90,90) Mp-1074.4C-0.2) Tp° 31 A2=20l/29 P-NLAEDIMRL 
genpept PR->gi I 37852 | emb I CAA79613 . X I (Z19554) vimentin [Homo 
15 sapiens] P0S=177 (SEQ ID NO: 168) 

G=3427; Mg-1075.4: Tg=46+-0 #D+(1,1) #F+(8,6) 

S-72(73,72) Mp=1075.6( 0.2) Tp= 44 A2=>lk/31 P=YLP£LLQTV 
qenpept ?R=>gi 1 12653299 I gb | AAHQ0418 . 1 1 AAH00418 (BC000418) 

20 ectodermal-neural cortex (with BTB-like domain) [Homo sapiens] 
POS=22$ (SEQ ID WO: 169) 

G=3470; Mg=l080.4: Tg=»62+-1 #D+{13,4) #E+(17,8) #F+(U,6) 
#EST+{2,1) #FR+{6,3) #G+(19,9) #K+(10,5) #L+(1,1) 
25 3=82(82,82) Mp-1080,6( 0.2) Tp« 69 A2=>lk/27 P-FLYPFPLAL 
genpept PR~>gi I 436224 ) dbj I BAA05062 . 1 1 (D26067) KIAA0033 [Homo 
sapiens] POS=l85 (SEQ ID NO: 170} 

G«3472: Mg=l080.4: Tg=50*-0 #F+(X3,7) #G+(25,11) #K+{10,5) 
30 S=76(87 f 52) Mp=1080,5( 0-1) Tp= 53 A2=182/33 P=SLLPPTALVGL 
qenpept PR=>gi 1 1296664 | emb I CAA65774 - 1 i (X97G64) Sec23 protein [Homo 
sapiens] POS^156 (SEQ ID MO: 19) 

G=3476: Mg=10B0.7: Tg=41+-1 #FR+(1,1) #G+(5,4) 
35 3=75(77,72) Mp=1080 . 6 (-0 . 1 ) Tp= 38 A2=>lk/29 P=NLYPrVKTV 
genpept FR=>gi 1 1263196 I gb i AAA97405 . 1 1 (U37436) AICAR 

formyltransf©rase/IMP cyclohydrolsse bifunctional enzyme [Homo 
sapiens] POS=101 (SEQ ID NO: 171) 

40 G=3477: Mg=1081.4: Tg=56+-0 #F+<6,3) 

S=90(87,99) Hp-1081.7( 0.3) Tp- 57 A2=>lk/24 P-SVIEQLFFV 

qenpept PR=>gi I 30140 i emb I CAA34277 ♦ 1 1 (X16155) COUP-TF [Homo sapiens] 
POS=378 (SEQ ID NO: 172) 

45 G=3478: Mg-1081.4: Tg-56+-0 #G+(4,3) 

S=84 (86,81) Mp=l080*6<-0.8) Tp= 57 A2=>lk/29 P-SLLEPFVYL 
genpept PR=>gi I 7008404 | gb I AAF34999 . 1 1 (AF229840) kappa B-ras 2 [Homo 
sapiens] POS°156 (SEQ ID NO: 173) 

50 G=3497: Mg=1084.7: Tg=24+-3 #F+(1,D #EST+(3,1) #FR+{2,1) 

$=79(82, 72) Mp-=1084 .6(-0.1) Tp- 37 A2o437/26 P=ILFGHENRV 
qenpeyt PR=>gi I 591194 X I smb \ CAB55946 . 1 1 (AL117471 ) hypothetical 
protein [Homo sapiens] POS=250 (SEQ ID NO: 174) ref 

55 G=3505; Mg=1096.5; Tg=19+-3 #G+[18,6) 
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S=88(91,83) Mp=1086.7( 0.2) Tp= 20 A2«998/29 P-KLQEVGQVSV 
genpept PR=>gi | 340307 [ gb i AAA368Q8 , 1 1 (M14648) vitronectin alpha 
subunit precursor [Homo sapiens] POS=338 (SEQ ID NO: 175) 

5 G=3520: Mg-1088 , 6: Tg«37+-1 #F+(3,2) ttEST+{2,l) #G+(8,5) #HC+ (5,4) 
5=72(82,49} Mp-1088 . 5 (-0 . 1 ) Tp= 37 A2= 75/16 P°RLFDEPQLA 
genpept PR=>gi | 3334982 | gb | AAC2 6984 . 1 | AAC26984 (AC005306) R27216J. 
[Homo sapiens] POS-2 (SEQ ID NO: 176) 

10 G-3521: Mg=l088.6: Tg=4 4+-1 #Q+ < 5 r ^ ) 

S=70(71,58) Mp=1088.6( 0.0) Tp= 43 A2<*119/30 P=3LFPGKLEW 
genpept PR=>gi I 44 0177 | gb I AAC035 68 . 1 1 (U01184) flightless-I homolog 
[Homo sapiens] POS=1009 (SEQ ID NO:177) 

15 G=3526: Mg=1089.6: Tg=50+-1 #S+(10 f l) 

3=84(91,68) Mp=1089.6( 0,0) Tp= 57 A2« 37/23 P°VMLGTPFLvr 
genpept PR=>gi I 458 9536 I dbD I BAA7 67 90 - 1 1 (AB023163) KIAAQ946 protein 
[Homo sapiens] POS^340 (SEQ ID NO; 178) 

20 G-3533: Mg=1091.4: Tg°15+-2 ftG+{12,4) 

S=86(95,68) Mp=1091,4( 0,0) Tp= 13 A2= 80/20 P=GVYDGEEHSV 

ggnpept FR=»gi I 4 10274 9 | gb | AAD01565 . 1 1 (AF015766) MAGE XP-2 protein 
[Homo sapiens] POS=231 (SEQ ID NO: 20) 

25 G=3545: Mg=l094,4; Tg-$0+-l ttE+(22,9) #F+(16,7) #EST+(4,2) 
#FR+{4,2) #G+(50,11] #K+(11,6) #L+{2,2> 

3=80(89! 59) Mp=1094.5{ 0.1) Tp= 49 A2=l82/33 P^SLLPPDALVGL 
genpept PR=>gi [ 13529299 I gb j AAH05404 . 1 |AAH05404 (BCOO54 04) Unknown 
(protein for MGC:5020) [Homo sapiens] POS-156 (SEQ ID NO: 21) 

30 

G=3563: Mg=1098.3: Tg=38+-1 #D+(7,3) #£+(10,6) ttF+(5,4) #EST+(3,2) 
#FR+(4,2) #G+(12,8) 

S=38(88i90) Mp°1098.4( 0.1) Tp= 30 A2=28Q/26 P-SLYDYNPNL 
genpept PR=>gi1 3337383 I gb|AAC27 4 26.1 1 (AC0Q2544) Translation 
35 initiation factor elF-pllO [Homo sapiens] POS°381 (SEQ ID NO: 179) 

G=3566: Mg=1098.6: Tg=50-t--0 #E+(6,3) #EST+(3,U #FR+(2,I) 

S=92 Mp-1098,7< 0.1) Tp= S2 A2=194/25 P=FLLGPPLVLA 

genpept PR=>gi I 887368 I gbl AAC42003 . 1 1 (L40397) ORF; putative [Homo 
40 sapiens] POS=31 (SEQ ID NO: 180) 

G=3579: Mg=1101.4: Tg=34+-1 #F+(5,4) #G+(2,2) 

5=88(91,93) Mp=U01,4( 0.0) Tp= 35 A2-502/24 P=FLYTGEGDTV 
genpept FR=>gi [ 1184320 I gb I AAC50373 . 1 j (045880) X-linked inhibitor of 
45 apotosis protein [Homo sapiens] POS=52 (SEQ ID NO: 181) 

G=3568: Mq=1102.6: Tg=34+-1 #E+U,1) #ES*+{4,2) #FR+{2,2) 
5^82(83,81) Mp=U02.6( 0.0) Tp= 27 A2=>lk/26 P=KLNPQQFEV 
genpept PR=>gi [ 624704 I gb I AAB05994 . 1 1 (L38 961) putative transmembrane 
50 protein precursor [Homo sapiens] POS=289 (SEQ ID NO: 182) 

G=3596: Mg=ll03.4: Tg=28+-2 #F+(3,2) #G+(1,1) 

S~88{94,74) Mp=1103.4{ 0.0} Tp= 23 A2=140/27 P=SLADLQNDEV 
genpept PR->gi | 854179 1 emb I CAA60827 . 1 | (X87373) nbosomal protein S3a 
55 [Homo sapiens] POS=70 (SEQ ID NO: 183) 

G=3603: Mg-1104.7: Tg=4$+-0 #D+{1,1) #F+{4,3) #EST+(3,1) #FR+{5,2) 
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5=85(99,54) Mp°H04.7( 0,0) Tp° 40 A2°364/28 P-RLLDYWNI 
cjenpept PR=>gi I 702376B I dbj I BAA92081 . 1 1 (AKO02094) unnamed protein 
product [Homo sapiens] POS°172 (SEQ ID NO: 184) 

5 G^3629; Mg=1113.5: Tg°35+-1 #G+(10,6) #K+(10,5) 

S=77{76,81) Mp-lll3.6( 0,1) Tp= 31 A2- 46/21 P^FVDDYWRV 
genpept FR°>gi | 1923256 | gb IAAC51866 . 1 I (066762) 26S 

proteasome-associated padl homolog [Homo sapiens] POS=61 {SEQ ID 
NO:X85) 

10 

G=3637: Mg=1115.4: Tg=55+-1 #E+(29,8) #F-t-{14,7) #EST+(2,1) 
#FR+(4,2) #Gt{9,5) #L+(1,1) 

S=B2{90,66) Mp=lll5.5( 0.1) Tp° 61 A2=>lk/29 P=SLFEGTWL 
genpspt PR=>gi | 12653065 | gb I AAH00297 . 1 | AAH00297 (BC000297 ) 

15 3-hydroxy-3-methylglutaryl-Coenzyme A synthase 1 (soluble) [Homo 
sapiens] PQS°447 (SEQ ID NO: 186} 

G=3652; Mg=1119.5: Tg=56+-0 #D+(4,3} 

2=80(84,72) Mp=1119.7( 0.2) Tp= 57 A2=5I2/27 P=ALYNWLIQV 
20 genpept PR°>gi | 3288447 i embl CAA07553 . 1 1 (AJ007558) nucleoporin 155 
[Homo sapiens] POS=1038 (S£Q ID NO: 187) 

G=3653: Mg=1119.6: Tg=30+-1 #D+<1,1) #F+(3,2) #ES?+{1,1) #FR+{2,1) 
#G+[10,S) 

25 S=60 (84, 72) Mp=lll9.7( 0.1) Tp= 30 A2= 97/25 P=>VLIDYQROT 
genpept PR->gi | 2626840 | dbj | BAA23415 , 1 1 (D89729) CRMl protein (Homo 
sapiens] POS=784 (SEQ ID NO: 188) 

G=3658; Mg=1121.3: Tg=49+-0 #S+(9,2) #D+{7,3) #£+(9,5) #F+(14,7) 
30 #G+<26,11) #K+(2,2) #L+(1,1) 

3-84(81,91) Mp=1121.5< 0.2) Tp= 47 A2-577/24 P=TLWDPYEV 
genpept PR=>gi 1 1703501 |gb| AAB37580, 1 1 (U72649) BTG2 [Homo sapiens] 
PQS°101 (SEQ ID NO: 22) ref 

35 G=3683: Mg=1128>3: 1g-51+-0 #S #G+(17,9) #K+(2,1) 

S=86{88,82) Mp=1128,5( 0.2) Tp= 55 A2^348/25 F=FTWEGLYNV 
genpept PR=>gi ! 1276912 | gb | AAC50450 . 1 1 (044839) UHX1 protein [Homo 
sapiens] PfJfi=353 (SEQ ID NO: 189) 

40 G=3694: Mg=1133.6: Tg=25+-3 #D+{2,1) #F+(1,1) #G+{7,3) 

S=85 (87,81) Mp°1133.7£ 0.1) Tp= 30 A2=>l)c/32 P-ILMEHIHKL 
genpept PR=>gi | 298486 I gb | AAB25672 . 1 1 (S56985) ribosomal protein L19 
[human, breast cancer cell line, MCF-7, Peptide, 196 aa] [Homo 
sapiens] POS=l37 (SEQ ID WO: 190) ref 

45 

G=3597: Mg=1134.6: Tg=42+-1 #£+{12, 6) #F*{1,1) #EST+(3 f l) 

#FR+(3 f l) #G+(9,S) #K+(13,6) #L+(1,1) 

S=81(93,53) Mp=1134.6{ 0,0) Tp= 37 A2*=193/26 P-RLDELGGVYL 
genpept PR->gi f 13374901 1 emb | CAC3 4517 . 1 \ (AL031659) dJ343K2<2.3 
50 (ribophorin II (iaofom 3)) [Homo sapiens] FOS=485 (SEQ ID NO: 191) 

G=3711: Mg=1140.6: Tg=40+-1 #EST+{1,1) #FR+(1,1) #G+(l f l) 
S=89(93,82) Mp=1140.7( 0,1) Tp= 40 A2=526/27 P=KLLSKFY£t 
genpept PR=>gi 1 10439903 | dbj I BAB15591 . 1 1 (AK02693Q) unnamed protein 
55 product [Homo sapiens] POS^231 (SEQ ID NO: 192) 

G=3721: Mg=1145.4: Tg=49+-1 #S+{7,1) #F+(2,2) #G+(14,10) 
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S=79(83,7G) Mp°1145.5( 0.1) Tp= 50 A2=>lk/23 P^FLFDGSPTYV 
qenpept PR=>gii 104 9053 I gb| AAC50259. 1 1 (U26644) encodes region of 
fatty acid synthase activity; FAS; multifunctional protein [Homo 
sapiens) PQS=2329 (SEQ ID NO:23) 

5 

G=3729: Mg°1147.5: Tg=48+-1 #E+(3,2) #EST+(2,1) #FR+{5,2) #G+(4,3) 
#K+(13,6) 

S°91(92,90) Mp=1147,7( 0.2) Tp° 45 A2^>lk/20 P=KVLDFEHFL 
qenpept PR=>gi 1 189022 ( gb I AAA36348 . 1 1 (M22920) smooth muscle mysoin 
10 light chain [Homo sapiens] POS=28 (SEQ ID NO: 193) 

G«3743; Mg=1152.6: Tg=47+-0 #D+(5,3) #f+(l,l) 

3=79(82,72) Mp=1151. 6 (-1. 0) Tp= 43 A2=>lk/24 P=YLPEDFIRV 
qenpept PR=>gi j 2653877 | gt>| AAB87669. 1 j (AF026273) interleukin-1 
15 receptor-associated kinase-2; IRAK- 2 [Homo sapiens] POS=381 (SEQ ID 
HO: 194) 

G=3754: Mg=1156.5: Tg=35+-1 #G+{3,2) #K-c{8 f 5) 

$=91(95,83) Mp=1156,5( 0.0) Tp= 43 A2-403/28 P=FLSEHPNVTL 
20 qenpept PR°>gi | 5102831 1 emb I CAB45270 . 1 1 (AL022318) bK150C2,2 
(Phorbolin 3) [Homo sapiens] POS=107 (SEQ ID NO: 195) 

G=3806: Mg=1210.4: Tg^42+-1 #E+(7,4) #EST+(1,1) #FR+<5,3) 

#G+(20,U} #K+(10,6) #L+(4,2) 
25 S"76(80,68) Mp=121Q.6( 0.2) Tp= 44 A2°128/21 P=LLLDVPTAAVQA 
qenpept PR=>gi | 6165618 I gb I AAF04618 . 1 I AF097362_1 (AF097362) 

gamma-inter ferojfi inducible lysosomal thiol reductase [Homo sapiens] 
POS=26 (SEQ ID NO: 3) ref 

30 G-3831: Mg=1258,5: Tg«54+-1 #S+(12,2) #E+{12,6) #F+(12,6) 

#£ST+(1,1> #FR*(7,3) #G+<20,10) #H+(1,D #K+(10,5) 

3=87(96,68) Mp=l258<6{ 0.1} Tp= 58 A2=611/27 P=FLFDGSPTYVL 
qenpept PR=>gi| 104 90531 gb I AAC50259.1 | (U26644) encodes region of 
fatty acid synthase activity; FAS; multifunctional protein [Homo 
35 sapiens] PQ5°2329 (SEQ ID NO: 24) 

G-3859: Mg°1360.4: Tg=44+-1 #£+(3,2) #G+(19,10J 

S=91(99,75) Mp=1360.6( 0.2) Tp= 42 A2=>lk/28 P=AX,WDIETGQQTV 
qenpept PR=>gi | 306785 1 gb | AAA35922 , 1 1 (M16538) G protein beta subunit 
40 [Homo sapiens] P0S=167 (SEQ ID NO: 25) 



DISCUSSION 

Among the thousands of different peptides presented within the context 
of the MHC class-I on cancer cells, only a few may eventually become 
45 candidates for the development of anti-cancer vaccines. The identification of 
such cancer specific peptides depends on sequencing a relatively large number 
of peptides. 
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While reducing the present invention to practice, a novel method was 
developed to identify candidate peptides for the development of anti-cancer 
vaccines. The novel method involves expressing the soluble extra-cellular 
domain of the MHC molecules that are simple to purify and the recovery, from 
5 them, large amounts of MHC bound peptides ready for identification by 
ES1-MS/MS. 

Purification of the extra-cellular domain of MHC was previously 
achieved by truncating its entire transmembrane and cytoplasmic domains 
[30], by using a non-functional transmembrane domain such as QlOb [24] or 

10 fusing the extra-cellular domains to soluble secreted proteins such as 
antibodies Fc domains [31, 32]. Such sMHC molecules were produced in 
cultured cells of murine [33], human [30, 34] or insect [35] and in bacteria 
[36]. The soluble MHC molecules expressed by the murine or the human cells 
were capable of binding to their cognate TCRs, indicating the presence of 

is bound authentic peptides that mediate this interaction [33, 37]. Bound 
peptides recovered from the secreted murine MHC H-2Ld were analyzed by 
Edman sequencing [38], More recently, peptides recovered from the murine 
Q2/Q10 D , which is a natural mutation resulting in the formation of soluble and 
secreted MHC molecules, were analyzed by ES1-MS/MS [39]. The results, 

20 however, were very disappointing as only six peptides were recovered from 50 
liters of culture medium [39]. 



MAY -2001 21". 22 FROM: 



TO: 972 9 7706333 



P. 093 



92 

While culture cancer cell lines are invaluable model for cancer research, 
only a limited number of good model lines are available for the study of tumor 
immunology since some of the better mode] cell lines have rare MHC 
haplotypes or down regulated MHC expression altogether. The introduction of 

5 foreign MHC into such cells in accordance with the teachings of the present 
invention facilitates the use of the desired model cell lines for the search for 
cancer specific MHC bound peptides. The recovery of secreted MHC from the 
growth medium helps to sidestep possible interference by the cell's 
background MHC hapjotypes. 

10 The number of peptides identifiable during each ESI-MS/MS run 

performed in accordance with the present invention was limited by the rate the 
mass spectrometers can switch between measuring the full spectrum to 
performing CID 5 which was about four seconds. Therefore, during a 
chromatography of ninety minutes, around a thousand different peptides could 

1 5 be mass measured and fragmented. The elution order of most of the peptides 
recovered for MHC of a particular type and resolved in different 
chromatography runs was similar. Therefore, their masses and CID data were 
combined in order to improve their signal-to-noise ratio. 

About one thousand different molecules that are certainly peptides have 

20 been fragmented at least twice in all the different chromatographs and out of 
these about two hundreds different peptides have been identified at high 
certainty. Most of these peptides were derived from housekeeping proteins 
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and only a few were derived from proteins related to cancer. To increase the 
likelihood of identifying more new cancer specific peptides, the total number 
of identified peptides should be further enlarged. Identification of large 
number of peptides is currently limited by both the availability of sufficient 
5 amounts of peptides, by the capabilities of the mass spectrometers and by the 
non-completeness of the databanks. With the expected near availability of the 
entire human genome sequence, it is expected that more of the peptides will be 
identifiable, excluding mutant peptides that will still need to be sequenced de 
novo. 

10 The soluble and secreted MHC molecules described here present 

similar patterns of peptides as do the original cell surface MHC, This 
conclusion emanates from the observation that most of these peptides, posses 
an amino acid sequence that fit the known sequence consensus of HLA-A2.1 
and of B7 (see score columns in Table 8 above). Some of the peptides have 

15 been identified previously as MHC bound peptides and thus indicate the 
validity of the methodology of the present invention. The most significant 
advantage of the use of secreted MHC as a source for peptides for analysis has 
to do with the order of magnitude larger recovery of sMHC molecules and 
therefore peptides per cell over the alternative purification from detergent 

20 solubilized cells and the purified sMHC molecules were free of interfering 
cellular debris and detergents. 
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Direct biochemical analysis of peptides eluted from MHC molecules 
that are recovered from cancer cells, allows unbiased identification of those 
peptides that are actually presented, by the MHC. Even though, identifying 
putative MHC bound peptides using computer programs based on the 
5 consensus motifs followed by synthesizing them, and testing their 
immunogenicity, bypasses the reliance on expensive and technically 
demanding mass-spectrometry needed for biochemical analysis of MHC bound 
peptides. However, the motif prediction approach is dependent on the 
availability of well-established consensus for the MHC allele of interest and is 

10 hampered by the difficulty of taking into account the processing machinery 
involved in generating the peptides and transporting them to the MHC [13]. 
Moreover* it was suggested that contaminating protecting groups inadvertently 
left on the synthetic peptides are very immunogenic and may become the target 
for the activity of the CTLs. The CTLs generated in vitro are often low 

15 affinity binders and incapable of recognizing the rare peptides actually 
presented by the cancer cells in vivo [10]. 

The examples of identified peptides listed in Table 8 above include 
peptides that do not fit the accepted consensus of MHC bound peptides 
presented by the studied MHC haplotypes. Peptides longer than ten amino 

20 acids are not expected to be common among MHC class-I peptides [40, 41]. 
However, in this study, few peptides of 1 1 amino acids (pl210, SEQ ID NO:3 
and pl258, SEQ ID NO:24) and 12 amino acids (pl360 ? SEQ ID NO:25) long 
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were observed among the identified peptides. The computer programs for 
motif predictions of class-I peptides are not able to predict such peptides as 
their length is outside the consensus [42, 43]. The detection of longer peptides 
among the peptides in the natural mixture suggests that the consensus motif 

5 should possibly be extended to include such outliers. Another interesting 
observation is the relatively abundance of peptides that originated from 
overlapping parts of the proteins with one or two amino acids difference in 
length, such as p800 (SEQ ID NO;4) and p913 (SEQ ID NO:5) from (J-catenin, 
pi 145 (SEQ ID NO:23) and pl258 (SEQ ID NO;24) from fatty acid synthase, 

10 p898 (SEQ ID NO:l), plOll (SEQ ID NO:2) and pl210 (SEQ ID NO:3) from 
IP30 (Table 8). Moreover, peptides plOSG (SEQ ID NO:19) and p!094 (SEQ 
ID NO;21) are derived from homologous site in two alleles of the same 
protein. This observation points to the existence of structural hotspots for 
generation of peptides, possibly as a result of heath-shock proteins binding and 

is protection from complete proteolysis of these regions. Differences in length 
could also result from incomplete trimming of the peptides in the endoplasmic 
reticulum [44, 45]. 

Interesting observations are the large similarities between the patterns 
of peptides produced by cell lines of different tissue origin and on the other 

20 hand, the presence of a few peptides that are unique to one type of cancer cells. 
The ability to characterize the similarities and differences between peptide 
patterns of different cell lines and growth conditions and between different 
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HLA haplotypes are among the most important possible uses of die novel 
methodology presented herein. 

The most effective mean to ascertain tfie identity of the amino acid 
sequences of peptides that were identified by this method is to compare their 
5 retention times, their exact masses and their CID data to those of the 
corresponding synthetic peptides [16, 39, 46, 47]. The sequences of all the 
peptides that were identified at high confidence by searching the databank with 
their mass spectrometry data were shown to be correct when these parameters 
were compared with the corresponding synthetic peptides. 
10 A number of peptides identified here were derived from known tumor 

antigens. Those peptides that attracted the attention as possibly cancer specific 
were chemically synthesized and tested again. The fact that a few of them 
elicited a CTL response in mice may point to their possible inununogenicity in 
human.. 

15 Tumor proteins from which identified peptides were derived included 

mucin (MUCl), a well-studied tumor-associated antigen that is up regulated in 
breast and ovarian carcinomas [48]. A number of HLA-A2.1 restricted 
MUCl -derived CTL epitopes were identified by the motif prediction approach 
[26, 49-52]. Peptide p947 (NLTISDVSV, SEQ ID NO:8) identified here from 

20 breast carcinoma cells (MCF-7) is the same peptide that was predicted and 
confirmed to be a HLA-A2 antigen originating from MUCl by Carmon et al. 
[26]. 
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Another peptide derived from a known tumor antigen, was pl091 (SEQ 
ID NO:20) from the testis-cancer antigen MAGE-B2. It belongs to a group of 
21 known genes that are essentially silent in most normal cells except for testis 
and placental trophobalsts and since different member of Hie MAGE proteins 

5 are expressed in a variety of tumors, they attracted significant attention as 
cancer vaccine candidates [53-57]. A few peptides were identified so far from 
the MAGE proteins by genetic approach and by predicting their sequence 
based on the known motifs rather than by the biochemical approach [27, 28, 
58-61] (reviewed in [10]), The identification of the novel MAGE-B2 derived 

10 peptide pl091 (GVYDGEEHSV) (SEQ ID NO:20) by the direct biochemical 
approach is a very encouraging observation that confirmed the validity of this 
method for identification of novel tumor specific antigens, Homologous 
peptides from MAGE-A4 and MAGE-A10 proteins were previously identified 
as MHC bound peptides and tested for their immunogenicity (see Figure 4D). 

15 This suggests the existence of a possible hot spot within the MAGE protein for 
processing as MHC bound peptides [27, 28]. 

Peptides derived from other proteins that are involved with cancer 
progression and may also serve as candidates for anti-cancer vaccines of 
diagnosis include p913 (SEQ ID NO:5) from p-catenin, which is normally 

20 involved in cellular adhesion, signal transduction and as a transcription 
enhancer with a possible oncogenic role in colorectal cancer. Abnormal high 
amounts of the protein were found in the cytoplasm in cancer cells instead of 
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the intracellular boundary in normal cells and this abnormal behavior was 
correlated with metastasis [62-64]. Peptide pi 145 (SEQ ID NO;23) and P 1258 
(SEQ ID NO;24) is derived form fatty acid synthase (FAS), a biosynthetic 
enzyme expressed in liver and lactating breasts and is a marker of poor 

5 prognosis when expressed in colon, prostate, ovarian, breast and endometrial 
cancers. Its significance for cancer is was established by inhibiting it activity, 
which leads to apoptosis in cancer cells [65-69], The enzyme DNA methyl 
transferase (MTDM) is the source protein for pl028 (SEQ ID NO: 13) an 
en2yme that is highly expressed in different cancer cell types, including 

10 prostate and breast [70-72], Increased MTDM activity is usually associated 
with tumor progression and is considered to be an important event in cell 
transformation [71 s 73]. 

Once tumor specific MHC bound peptides are identified and their 
ability to stimulate an immune response is demonstrated, such peptides 

15 become candidates for adoptive immunotherapy. Identification of peptides 
originating from normal proteins that are uniquely expressed in non-vital 
organs, such as breast, prostate and ovaries can become very useful for 
immunotherapy of these cancers. The potential usefulness of identified 
immunogenic peptides should be evaluated by the presence of specific T cells 

20 directed against them in patients inflicted with the particular cancer using 
standard assays such as ELISPOT and CTL. The assay of immunizing mice 
with the peptides described herein was meant to serve first as validation that 



29-MAY-2001 21: £4 FROM: 



TO: 972 9 7706333 



P. 100 



99 

these peptides are indeed MHC bound peptides with affinity for the HLA-A2.1 
and as the preliminary indication of their immunogenic potential. 

Secreted soluble MHC such as described herein can also be used for 
analysis of peptides presented by cells involved with pathologies odier than 

5 cancer, such as autoimmune diseases and viral infections with the aims of 
identifying peptides of significance for treating these diseases. The method 
can also be used for identifications of MHC bound peptides presented on 
normal cells of specific tissues, peptides presented by particular MHC alleles 
and peptides originating from expression of particular proteins of interest. 

10 Moreover, the approach can be used for analysis of MHC bound peptides 
derived from over-expression of specific proteins, from induced mutations, as 
a result of metastasis progression and as a way for searching for peptides 
derived from signal peptides of cell surface proteins. The approach described 
in this study is also useful for comparisons between patterns of MHC bound 

15 peptides induced by minor changes in the cells growth conditions such as the 
addition of hormones, the expression of a foreign protein or under stress 
conditions. 

Therefore, an appealing outcome of the methodology described herein 
is that the simple expression of different recombinant MHC molecules in 
20 different cell lines in a soluble, secreted form and their easy recovery from the 
growth medium with their peptides still attached, followed by comprehensive 
analysis of the peptides may become a good staging point for above listed 
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ambitious research projects. Such 'human MHC-peptide projects' are worthy 
goals to follow the human genome and proteome projects. 

It is appreciated that certain features of the invention, which are, for 
5 clarity, described in the context of separate embodiments, may also be 
provided in combination in a single embodiment Conversely, various features 
of the invention, which are, for brevity, described in the context of a single 
embodiment, may also be provided separately or in any suitable 
subcombination. 

10 

Although the invention has been described in conjunction with specific 
embodiments thereof, it is evident that many alternatives, modifications and 
variations will be apparent to those skilled in the art. Accordingly, it is 
intended to embrace all such alternatives, modifications and variations that fall 

15 within the spirit and broad scope of the appended claims. All publications, 
patents and patent applications mentioned in this specification are herein 
incorporated in their entirety by reference into the specification, to the same 
extent as if each individual publication, patent or patent application was 
specifically and individually indicated to be incorporated herein by reference* 

20 In addition, citation or identification of any reference in this application shall 
not be construed as an admission that such reference is available as prior art to 
the present invention, 
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